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Part I 

Introduction 


In this part, we introduce monads and their algebras. Section contains the 
basic definitions: first illustrated on examples of finite words and oo-words, and 
then formally defined. Sections |3][^ show how some results about languages 
can be stated and proved on the level of monads, including: the Myhill-Nerode 
theorem (Section]^, Eilenberg’s pseudovariety theorem (Section]^, and some 
parts of the connection between regular languages and MSO (Sections and [^. 
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1 Introduction 


The principle behind algebraic language theory for various kinds of structures, 
such as words or trees, is to use a compositional function from the structures into 
a hnite set. To talk about compositionality, one needs some way of composing 
structures into bigger structures. It so happens that category theory has an 
abstract concept for this, namely a monad. The goal of this paper is to propose 
monads as a unifying framework for discussing existing algebras and designing 
new algebras. To introduce monads and their algebras, we begin with two 
examples, which use a monad style to present algebras for finite and infinite 
words. 


Example 1. Consider the following non-standard definition of a semigroup. 
Dehne a -I—algebra A to be a set A called its universe, together with a multi¬ 
plication operation mulA : A+ —)■ A, which is the identity on single letters, and 
which is associative in the sense that the following diagram commutes. 


(A+)+ 

(mulA)'''j 

A+ - 


UA 


mulA 


A+ 

|mulA 

-A 


In the diagram, (mulA)’*’ is the function that applies mulA to each label of a 
word where the alphabet is A '^, and is the function which flattens a word of 
words into a word, e.g. 

(abc)(aa)(acaa) i—)■ abcaaacaa. 

Restricting the multiplication operation in a -I—algebra to words of length two 
(the semigroup binary operation) is easily seen to be a one-to-one correspon¬ 
dence between -I—algebras and semigroups. □ 

The second example will be running example in the paper. 

Running Example 1. Let us define an algebra for infinite words in the spirit of 
the previous example. Define A°° to be the oo-words over A, i.e. Define 

an oo-algebra A to be a set A, called its universe, together with a multiplication 
operation mulA : A°° — >■ A, which is the identity on single letters, and which is 
associative in the sense that the following diagram commutes. 

(mulA)°° j 

A“- 

mulA 

In the diagram, (mulA)°° is the function that applies mulA to the label of every 
position in a oo-words where the alphabet is A°°, and iia flattens an oo-word 
of oo-words into an oo-word. If the argument of p,A contains an infinite word 
on some position, then all subsequent positions are ignored. 


jmulA 

A 
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An oo-algebra is essentially the same thing as an w-semigroup, see [PP04] . 
with the difference that w-semigroups have separate sorts for finite and infinite 
words. There is also a close connection with Wilke semigroups [WilQlj . which 
will be described as the running example develops. □ 

The similarities in the examples suggest that the should be an abstract no¬ 
tion of algebra, which would cover the examples and possibly other settings, 
e.g. trees. A closer look at the examples reveals that concepts of algebraic 
language theory such as “algebra”, “morphism”, “language”, “recognisable lan¬ 
guage” can be defined only in terms of the following four basic concepts (written 
below in the notation appropriate to -I—algebras): 

1. how a set A is transformed into a set A+; 

2. how a function / : A —>■ i? is lifted to a function /+ : A+ —)■ i?+; 

3. a flattening operation from (A+)+ —>■ A+; 

4. how to represent an element of A as an element of A+. 


These four concepts, subject to certain axioms, are what constitutes a monad, a 
fundamental concept in category theory (and recently, programming languages). 

The point of this paper is that, based on a monad one can also define things 
like: “syntactic algebra”, “pseudovariety”, “MSO logic”, “profinite object”, and 
even prove some theorems about them. Furthermore, monads as an abstraction 
cover practically every setting where algebraic language theory has been applied 
so far, including labelled scattered orderings [BR12) . labelled counta ble tota l 


orders [CCPll] , ranked trees |Ste92| , unranked trees |BW08) , preclones [EWOS 

The paper has three parts. 

Part I of this paper shows that several results of formal language theory can 
be stated and proved on the abstract level of monads, including: the Myhill- 
Nerode theorem on syntactic algebras (Section]^, the Eilenberg pseudovariety 
theorem (Section]^, or the Reiterman theorem (Section [TT|) on profinite identi¬ 
ties defining pseudovarieties. Another example is decidability of MSO (Section]^, 
although here monads only take care of the symbol-pushing part, leaving out 
the combinatorial part that is specific to individual monads, like applying the 
Ramsey theorem in the case of infinite words. When proving such generali¬ 
sations of classical theorems, one is naturally forced to have a closer look at 
notions such as “derivative of a language”, or “finite algebra”, which are used 
in the assumptions of the theorems. 

Part II includes shows how existing algebraic settings can be seen as a special 
case of monads. Part II also contains some new settings, illustrating how new 
kinds of algebras can be easily produced using monads. Specifically, Section 
describes a monad for words with a distinguished position, where standard the¬ 
orems and definitions come for free by virtue of being a monad. 

Part III, is devoted to profinite constructions. It is shown that every monad 
has a corresponding profinite monad, which, like any monad, has its own no¬ 
tion of recognisability, which does not reduce to recognisability in the original 
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monad. For example, the monad for finite words has a corresponding monad of 
profinite words, and recognisable languages of profinite words turn out to be a 
generalisation of languages of infinite words definable in the logic MSO+U. 

Thanks. I would like to thank Bartek Klin (who told me what a monad is), 
Szymon Torunczyk, Joost Winter and Marek Zawadowski for discussions on the 
subject. 

2 Monads and their algebras 

This paper uses only the most rudimentary notions of category theory: the 
definitions of a category (objects and composable morphisms between them), 
and of a functor (something that maps objects to objects and morphisms to 
morphisms in a way that is consistent with composition). Almost all examples 
in this paper use the category of sets, where objects are sets and morphisms are 
functions; or possibly the category of sorted sets, where objects are sorted sets 
for some fixed set of sort names, and morphisms are sort-preserving functions. 

A monad over a category is defined to be a functor T from the category to 
itself, and for every object X in the category, two morphisms 

r]x -.X and fix : TJX ^ TA, 

which are called the unit and multiplication operations. The monad must satisfy 
the axioms given in Figure In the language of sets, an intuition appropriate 
for this paper is that a monad inputs a set X, and produces the set of all 
“structures” whose “nodes” are labelled by elements of X. Depending on the 
monad, the structures could be words, or trees, or graphs, etc. The function 
Tfx inputs a label and produces a one-node structure that uses this label; while 
the function fix, which is the essence of the monad, flattens a structure of 
structures into a single structure. Basing on this intuition, we will use the name 
T -structures over X for elements of TX. 

We already saw two monads in Example and in the running example. 

For this paper, the most important thing about monads is that they have 
a natural corresponding notion of algebra. An Eilenberg-Moore algebra in a 
monad T, or simply T-algebra, is a pair A consisting of a universe A, which is 
an object in the category underlining the monad, together with a multiplication 
morphism 

mulA : TA —> A, 

such that the mulA o ?7 a is the identity, and which is associative in the sense 
that the following diagram commutes. 

TTA > TA 

TmulA I" I- mulA 

TA-^ A 

mulA 
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X 

Vx 

JX 


Tf 


Y 

Vy 

^TY 


TTf 

TJX ^ TTF 


t^x 

TX 


Tf 


Ml' 

Y 

TF 


TTTX ^ nx 


T)j.x 

TJX 


MX 


MX 

Y 

JX 



MX 


Figure 1: The axioms of a monad are that these four diagrams commute for 
every object X in the category and every morphism f : X ^ Y. The upper 
diagrams say that the unit and multiplication are natural. The lower left dia¬ 
gram says that multiplication is associative, and the lower right says that the 
unit is consistent with multiplication. 


Observe that this associativity is similar to the lower left axiom in Figure 
In fact, the lower left axiom in Figure and the upper half of the lower right 
axiom say that fix induces a T-algebra with universe TX. 

We use the convention that an algebra is denoted by a boldface letter, while 
its universe is written without boldface. A T -morphism between two T-algebras 
A and B is defined to be a function h between their universes which respects 
their multiplication operations in the sense that the following diagram com¬ 
mutes. 


JA — 

mulAj I mule 

A - 

h 

This completes the definition of monads and their algebras. 

Languages and colourings. To develop the basic definitions of recognisable 
languages over a monad, we require the following parameters, which we call the 
setting: the underlying category, the monad, a notion of finite alphabet, and a 
notion of finite T-algebra. So far, we do not place any restrictions on the notions 
of finiteness, e.g. when considering sets with infinitely many sorts, reasonable 
settings will often have finite algebras whose universe is not be finite in the same 
sense as a finite algebra. Actually, for some monads, it is not clear what a finite 
algebra should be, e.g. this is the case for infinite trees, and this paper sheds 
little new light on the question. Fix a setting, with the monad being T, for the 
following definitions. 
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A colouring of a T-algebra is defined to be a morphism from its universe 
to some object in the underlying category. For example, when the category is 
sets, then a coloring is like a multivalued language, i.e. instead of saying only 
“yes” or “no” to each input, a colouring can have multiple values. A coloring 
is said to be recognised by a T-morphism if it factors through it. A coloring is 
called T -recognisable if it is recognised by some T-morphism with a finite target, 
according to the notion of finite T-algebra given in the setting. 

Almost all examples in this paper are in sets, or in sorted sets. When the 
category is sets or sorted sets, we will focus mainly on the special case of colour¬ 
ings, namely languages, where colourings have two possible values on every sort. 
Consider a finite alphabet, according to the notion of finite alphabet given in 
the setting. In all of the examples of this paper where the category is sorted 
sets, a finite alphabet will be a possibly sorted set with finitely many elements. 
In particular, if there are infinitely many sorts, then a finite alphabet will use 
only finitely many. A T-language over a hnite alphabet E is defined to be any 
subset L C TS. Notions of recognisability are inherited from colourings, using 
the characteristic function of a language. Colourings are a mild generalisation of 
languages, for example, when the category is sets, then a colouring with finitely 
many colours is T-recognisable if and only if for every color, its inverse image is 
a recognisable language. 

When the monad T is clear from the context, we will sometimes skip the 
prefix T-, and simply write language, algebra, morphism, structure. 

Beyond recognisable languages. The recognisable languages will play the 
role of regular languages in the monad. One could go beyond regular languages. 
For instance, there is a natural monad version of context-free grammars, where 
the production rules have right hand sides in the monad applied to the terminals 
and nonterminals, and one can prove some theorems, like closure of context-free 
languages under intersection with recognisable languages. Context-free lan¬ 
guages are beyond the scope of this paper. 

3 Syntactic morphisms 

This section presents a monad generalisation of the Myhill-Nerode theorem, 
which gives a sufficient condition for colourings, and therefore also languages, 
to have a syntactic (i.e. minimal) morphism. The generalisation is proved only 
in the setting of sorted sets, and therefore also in the setting of normal set^ 
Fix a category of sorted sets, for some choice of, possibly infinitely many, sort 
names. A finite sorted set is one which has finitely many elements, in particular 
it can use only finitely many sorts. 

Finitary algebras. If T is a monad, then a T-algebra A is called finitary if 
for every ic S TA, there is some finite Aq C A such that w S TAq. Sometimes, 

^Bartek Klin has an alternative proof, which works in arbitrary categories, but requires 
some additional assumptions. 



a monad is such that every T-algebra is finitary, e.g. this is the case for the 
monad of finite words . 

Theorem 3.1 [Syntactic Morphism Theorem] Consider a monad T in a cate¬ 
gory of sorted sets. Let f be a colouring of an algebra A, which is recognised 
by a T-morphism h into some finitary T-algebra. There exists a surjective T- 
morphism into a T-algebra 


synt/ : A ^ Af, 

called the syntactic morphism of f, which recognises f and which factors through 
every surjective T-morphism recognising f. Furthermore, synt/ is unique up to 
isomorphisms on Af. 

Note that if A itself is finitary, then / is recognised by the identity T- 
morphism on A. Therefore, if a monad T is such that every T-algebra is finitary, 
then every colouring of a T-algebra has a syntactic morphism. This implies that 
every colouring has a syntactic morphism in monads such as the monad of finite 
words that corresponds to monoids, the monad of nonempty finite words that 
corresponds to semigroups, and several monads for describing finite trees that 
will be described later in the paper. Before proving the theorem, we give an 
example which shows how that a syntactic morphism might not exist in general. 

Running Example 2. Consider the monad of oo-words and the language 

L = {a^^ba’^^b- • • : the sequence is unbounded, i.e. limsuprii = oo.} 

We will prove that L does not have a syntactic morphism. Consider an equiva¬ 
lence relation ~ on natural numbers such that every equivalence class is finite. 
For example, ~ could identify all numbers that are between two consecutive 
powers of two. Define a function 

h^ : {a, 5}“ ^ N U (N^ x N/..) U {T, T} 

^ ■V' 

A 

as follows. If the input is infinite, then h..^ returns T or T depending on whether 
the input belongs to L. If the input has no 5’s, then hr^ returns the length. 
Finally, if the input contains at least one b, then h returns the triple consisting 
of: the number of a’s before the first b; the number of a’s after the last b; the 
equivalence class of the largest n such that the input has an infix ba^b (or the 
equivalence class of 0 if there is no such n). One can show that the kernel of h^^ 
is a congruence in the natural sense, and therefore A can be equipped with the 
structure of an oo-algebra which makes h an oo-morphism recognising L. 

Consider two equivalence relations ~i and ~2 on natural numbers, such 
that their transitive closure has infinite equivalence classes, e.g. identifies 
even numbers with their successors, while ~2 identifies even numbers with their 
predecessors. If there were a syntactic morphism h, then it would need to factor 
through both h.^.^ and and therefore it would need to assign the same 
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value to all words in ba*b. By associativity, h would assign the same value to all 
oo-words with infinitely many 6’s, and therefore it would not recognise L. □ 

The rest of Section|^is devoted to proving the Syntactic Morphism Theorem. 

3.1 Proof of the Syntactic Morphism Theorem 

We are working in a category of sorted sets; fix therefore a set of sort names, 
and a monad T. We first show that the syntactic morphism, if it exists, is 
unique up to ismorphisms on the target algebra. This is a consequence of the 
following lemma. In the lemma, the crucial distinction is between a function 
between universes of two T-algebras, and such a function which is a T-morphism, 
i.e. one that is consistent with the multiplication in the two algebras. 

Lemma 3.2 Let T he a monad, let A,B,C be T-algebras, let 

/ : A —> B and g : A —)■ C 

be T -morphisms, with f being surjective, and let h : B ^ C be a function such 
that, as functions on universes, the following diagram commutes. 

h 

'' 

c 



Then h is aT-morphism. 

Proof. 

This might be a standard lemma on Eilenberg-Moore algebras, although this 
proof uses right inverses, and it will therefore not work in every category. Con¬ 
sider the following diagram. 



The right triangular face (involving A, B, C) commutes by assumption of the 
lemma, and the left triangular face (involving TA, TB, TC) commutes by T 
applied to the assumption of the lemma. The two quadrangular faces commute 
because / and g are T-morphisms. Therefore, the entire diagram commutes. 
Because / is surjective, and we are in the category of sorted sets, / has a right 
inverse, i.e. a function f~^ : B —>■ A such that / o f~^ is the identity on B. By 
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the previous commuting diagram, the two paths from TA to C in the following 
diagram describe the same function. 

JB TA —^ TB B 

Th h 

Because T is a functor, it follows that the path connecting the two copies of TB 
in the above diagram is actually the identity on TB, and therefore the square 
face above commutes, which proves that h is a T-morphism.D 


Congruences. Define a congruence in an T-algebra A to be a surjective func¬ 
tion g : A ^ B from the universe of A to some set such that g o mulA factors 
through T g. 

Lemma 3.3 If A is a T-algebra and g : A ^ B is a congruence, then there is 
a multiplication operation on B which makes g into a T-morphism. 

Proof. 

The assumption that g is a congruence says that there is a function, call it mule, 
which makes the following diagram commute. 


TA—^TB 


hiuIa 

Y 


mulB 

y 


A- ^B 

g 

To prove the lemma, we need to show that mule is associative, which is explained 
in the following diagram. 



mulB 


The upper face commutes because T^ is a T-morphism between the free algebras 
TA and TB. The right and lower faces commute by the assumption on mule, 
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while the left face commutes by T applied to this assumption. It follows that 
all paths that begin in TT A and end in B denote the same function. Since g is 
surjective, we can use the same argument as in the end of Lemma |3.2| to show 
that the perimeter of the diagram commutes. □ 

Therefore, a congruence is simply a T-morphism with the algebraic structure 
on the target being ommitted. 

Polynomials. In universal algebra, a polynomial is a term with some con¬ 
stants from the algebra. We generalise this notion to monads. For a set X, 
define the set of polynomials over A with variables X to be 

pol^A = T(AUA). 

For a valuation v : X ^ A, we consider the evaluation function 

[-l(u) : polj^A —^ A 

which first replaces the variables in the argument polynomial by the valuation v, 
and then applies the multiplication in A. The notion of polynomials makes sense 
in arbitrary categories, not just those in sorted sets, but the following definition 
is specific to the category of sorted sets. Suppose that p is a polynomial over A 
with variables X. Define 


IpI : A^ a 

to be the function v >—)■ |p](u). A problem is that although |p] is a well-defined 
function, it is not a morphism in the category, because it is not necessarily sort 
preserving. For example, if X has just one variable x, then the function |p] is 
sort preserving only when the (output) sort of p is the same as the sort of the 
variable x. In the language of category theory, this problem is that monads in 
sorted sets need not be strong. The problem goes away when there is only one 
sort. 

If h : A ^ B is a T-morphism, and p G pol^A, then h{p) G poljfB is defined 
by applying h to the constants in p and leaving the variables alone. Formally 
speaking h(p) is obtained by applying Th' to p, where h' is the disjoint union 
of h and the identity on the variables. From the definition of T-morphism, it 
follows that the T-morphisms commute with polynomials in the sense that the 
following diagram commutes: 



I^(p)I 


( 1 ) 


Note that the diagram is not in the category of sorted sets, because the hori¬ 
zontal arrows are not necessarily sort preserving. 
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Unary polynomials. In our proof of the Syntactic Morphism Theorem, spe¬ 
cial attention is devoted to unary polynomials. In the setting of (unsorted) sets, 
which covers the well-known versions of the Syntactic Morphism Theorem for 
monoids or finite automata, the classical construction is to identify elements 
that cannot be distinguished by unary polynomials. To dehne unary polynomi¬ 
als in the setting of sorted sets, one needs a little care with the sorts. In the 
following, we assume that the name of each sort is also an element of its own 
sort. For sort names r and cr, a unary polynomial with input sort r and output 
sort a over A is defined to be a polynomial over A, which has sort u, and which 
uses just one variable, namely the sort name r. By abuse of notation, if r is a 
sort name then we write pol^A and |p](a), respectively, instead of the formally 
correct pol|.,-}A and |p](t >->• a). In the setting of (unsorted) sets, there is only 
one sort and unary polynomials can be composed forming a monoid. In the set¬ 
ting of sorted sets, to compose unary polynomials one needs to take care that 
the output sort of one unary polynomial matches the input sort of the other. 

Definition of the syntactic morphism. Consider a colouring 

/ : A ^ C, 

as in the assumptions of the Syntactic Morphism Theorem. Define an equiv¬ 
alence relation ~ on the universe A which identifies a,b € A if they have the 
same sort r and 

/(bl(a)) = fiMib)) for every p G pol^A. 

Dehne Ay to be the equivalence classes of and dehne the syntactic morphism 

synt/ : A —>■ Af 

to be the function which maps a to its equivalence class under ~. We will show 
that synt/ is a congruence, and therefore by Lemma |3.3| there is a multiplication 
operation on Ay image which makes synt/ into a surjective T-morphism. Let 
us begin by showing that ^ is a congruence with respect to polynomials with 
hnitely many variables, as expressed in the following lemma. 

Lemma 3.4 Let X be a finite set of variables, and let p G polj^-A. If vi,V 2 ■ 
X —> A are valuations then 

f\ Vi{x) ^ V 2 {x) implies bK^i) ^ b](^2)- 

Proof. 

The idea is that the dehnition of ^ guarantees the lemma for unary polynomials, 
and then induction extends the result to polynomials of higher hnite aritites. 

Consider hrst the case when X has exactly one variable, i.e. p is a unary 
polynomial. Let 01,02 be the values of the valuations vi,V 2 on the unique 
variable. We need to show that oi ^ 02 implies 

IpKoi) - [p]( 02 ). 
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Unraveling the definition of we need to show that 

(/ o M ° bl)(ai) = (/ o M ° M){a2) 

holds for every unary polynomial q whose input sort is the output sort of p. 
Composing the polynomials q and p yields a unary polynomial r such that that 
M ° M |r] describe the same function. By assumption that 01,02 are 
^-equivalent, they have the same values under / o |r], which proves the above 
equality, and completes the proof of the special case of the lemma when X has 
one variable. 

The case when X has more than one variable is proved by a straightforward 
induction on the size of X as follows. Let then , z ;2 : X —)■ A be as in the 
assumption of the lemma. Choose some parition X = Xi U X 2 with both Xi 
being nonempty, and define pi for i £ {1, 2} to be the polynomial obtained from 
p by substituting the variables from Xi with their values under Vi. Then 

bK^^l) = ~ bll(^'2UJ = lP2liv2\x^) ~ b2](w2|xi) = M(^^2)• 


□ 


Let US restate a special case of the above lemma in terms of commuting 
diagrams. 

Corollary 3.5 If X is a finite set then 



synt/ 


implies 


TX 

Tv2 

TA 

mulA 

A 






synt/ 


synt/ 


Proof. 

This is a special case of Lemma |3 . 4| where the polynomials have no constants in 
them, i.e. they are built entirely out of variables. □ 


Lemma 3.6 // ft, : A —>■ B js a T-morphism that recognises f, then synt/ 
factors through ft. 

Proof. 

The value of synt/ for an element of A is determined by the values of / o [pj 
on the element, ranging over all unary polynomials p of appropriate input sort. 
Therefore, to prove the lemma it suffices to show that / o |p] factors through 
ft for every unary polynomial p. This is the content of diagram Q and the 
assumption that ft recognises /. □ 
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We now resume the proof of the Syntactic Morphism theorem. Recall the 
assumption that the coloring / is recognised by a morphism 

h:A-^B 

into a finitary T-algebra. By Lemma [3.6[ the syntactic morphism factors through 
h, and therefore there is a function syntg/ which makes the following diagram 
commute. 


A— 



synte/ 

Y 




We will show in the following lemma that syntg/ is a congruence on B. The 
lemma will complete the proof of the Syntactic Morphism Theorem, because by 
Lemma 3.3 there is a multiplication operation on which makes it into an 
algebra A/ such that syntg/ is a T-morphism. Therefore, synt/ is a T-morphism 
from A to A/, as the composition of T-morphisms syntg/ and h. 


Lemma 3.7 syntg/ is a congruence in B. 


Proof. 

By the assumption that we are in a category of sorted sets, and the assumption 
that syntg/ is surjective, there is a right inverse 

synte/"^ : Af ^ B, 

i.e. a function such that syntg/ o syntg/”^ is the identity on Af. Define 

i = synte/”^ osynte/. 

Similarly, let h~^ : B ^ A he a. right inverse of h, i.e. a function such that 
ho h~^ is the identity on B. From the definitions of h~^ and i we see that the 
following diagram commutes. 


B (2) 



Later in the proof, we will use the above commuting diagram to show that that 
the assumptions of Corollary |3.5| are satisfied, when the mappings vi,V 2 from 
the Corollary are the identity and h~^ oioh, restricted to a finite subset of A. 
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We will prove that Ti does not affect the value under syntB/omulB, i-e. that 
the following diagram commutes 


mulB 



(3) 


Before proving that the diagram above commutes, we show how it implies the 
statement of the lemma. The statement is that syntg/ is a congruence, which 
means that iiw,w'€TB have the same image under Tsyntg/, then they have 
the same image under syntB/omulB- By definition i factors through syntg/, and 
therefore if w, w' have the same image under Tsyntg/, then they have the same 
image under Ti, and therefore they have the same image under syntg/ o mulB 
thanks to ([^. 

To prove we use the assumption that B is finitary, i.e. every structure in 
TB already belongs to TBq for some finite subset Bq C B. Therefore, to prove 
that the above diagram commutes, it suffices to prove that it commutes when 
the upper left TB is replaced by TBq for some finite Bq. Let then Bq be a finite 
subset of B. Define Aq to be the image of Bq under h~^, and define j : A ^ A 
to be the restriction oi h~^ o i o h to Aq. 


TBq 


Th- 


TAq 


Tj 


TA 




We claim that all paths that begin in the upper T Bq and end in >1/ denote the 
same function. Thanks to the functions j and the identity on Aq satisfy the 
assumptions in Corollary [STSI By the Corollary, the two paths on the permieter 
that begin in TBq and end in describe the same function. The two upper 
quadrangular face commutes by definition of j. The face which uses TB twice 
commutes because ho h~^ is the identity on B. The two faces which use mulA 
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commute because is a T-morphism. The two triangular faces commute by 
definition of syntg/. 

Since T/ioT/i“^ is the identity on TBq, we have proved that the diagram ^ 
commutes assuming that the top left corner is replaced by TBq; and therefore 
by the assumption on B being finitary we have proved that the diagram © 
commutes in general.□ 


4 Pseudovarieties 

This section is dedicated to a monad version of Eilenberg’s pseudovariety theo¬ 
rem. Eilenberg’s theorem says that, in the case of semigroups, language pseu¬ 
dovarieties and algebra pseudovarieties, which will be defined below, are in 
bijective correspondence. The theorem implies that if L is a language pseudova¬ 
riety, then the membership problem L S L can be decided only by looking at 
the syntactic semigroup of L, and one need not look at the accepting set, nor at 
the information about which letters are mapped to which elements of the semi¬ 
group. A typical application of the pseudovariety theorem is that definability 
in first-order logic, or various fragments thereof, can be determined based only 
on the syntactic monoid. The theorem does not give an algorithm to determine 
this, the algorithm needs to be found in a case-by-case way. 

In this section we prove that the pseudovariety theorem works in general 
for monads when the category is (possibly sorted) sets, with the same proof 
as in the case of monoids. Surely Eilenberg must have known this, since he 
invented both the pseudovariety theorem and algebras in abstract monads, but 
I have not found this result in his book [Eil74) . Our generalised pseudovariety 
theorem subsumes pseudovariety theorems for: finite words in both monoid and 
semigroup variants [Eil74j . c»-words |Wil91| . scattered linear orderings |BR12] . 
finite trees [Ste92| ; it also gives pseudovariety theorems for other known settings 
which have not had their pseudovariety theorems yet, such as forest algebra. 

Algebra pseudovarieties. The definition of an algebra pseudovariety is a 
straightforward generalisation of the definition given by Eilenberg for semi¬ 
groups or monoids. It is a class of finite algebras, according to the notion of 
finiteness given in the setting, which is closed under products, morphic images 
and subalgebras, as defined below in more detail. 

• Products. A class of T-algebras is called closed under products if when¬ 
ever A.B are in the class, then so is A x B. 

• Morphic images. A class of T-algebras is called closed under morphic 
images if whenever h : A —>■ B is a surjective T-morphism and A is in the 
class, then so is B. 

• Subalgebras. A class of T-algebras is called closed under subalgebras if 
whenever A is in the class, then every subalgebra of A is in the class. A 
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subalgebra of A is obtained by restricting the universe to a subset B such 
that mulA maps elements of TB to B. 

• Algebra pseudovariety. A class of finite T -algebras is called an algebra 
pseudovariety if it has all three closure properties defined above. 

Running Example 3. Call an oo-algebra A definite if the multiplication 
operation 

mulA : A°° A 

is such that the value of the multiplication depends only on the first n letters of 
the argument, for some n depending only on the algebra. Definite oo-algebras 
are easily seen to form an algebra pseudovariety. □ 

Language pseudovarieties. Unlike for algebras, the notion of language pseu¬ 
dovariety requires some discussion. For intuition, let us recall the original no¬ 
tion of language pseudovariety for semigroups that was introduced by Eilenberg. 
Eilenberg defines a language pseudovariety for semigroups to be a class of recog¬ 
nisable languages of finite words which is closed under Boolean combinations, 
inverse images of semigroup morphisms h : E’*' —)■ F"'", and derivatives. Here a 
derivative of a language L C is defined to be any language of the form 

w Lv = {u G h : wuv G L\. 

for some w,v G E*. 

It is not immediately obvious how to generalise the notion of derivative 
to abstract monads. We propose two solutions: one using unary polynomials, 
which we call a polynomial derivative, and one using syntactic morphisms, which 
we call a syntactic derivative. The advantage of polynomial derivatives is that 
they are closer to the derivatives used by Eilenberg, while the advantage of 
syntactic derivatives is that they make sense in settings without a clear notion 
of unary polynomials (recall that unary polynomials have only been defined for 
sorted sets). The two notions of derivative lead to different notions of language 
pseudovariety, which happen to coincide in settings that use sorted sets. The 
precise definitions are given below. 

• Boolean combinations. A class of T-languages is called closed under 
Boolean eombinations, if whenever it contains languages L C TE and 
K C TF, then it also contains 

Lf^K L\JK TE-T 

Of course, in the presence of complementation, only one of U, 0 is needed. 

• Morphic preimages. A class of T-languages is called closed under mor- 
phic preimages if whenever the class contains a language L C TE and 
h : TF -G TE is a T-morphism with F being a finite alphabet, then the 
class also contains h~^{L). 
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• Polynomial derivatives. (This definition assumes that the setting has 
a notion of unary polynomial, which have only been defined for sorted sets 
in this paper.) A class of T-languages is called closed under polynomial 
derivatives if whenever the class contains a language L C TS and p is a 
unary polynomial in the T-algebra TE, then the class also contains the 
language 

p-^L = {u; G TS : |p1(w) G L}. 

• Syntactic derivatives. A class of T-languages is called closed under 
polynomial derivatives if whenever the class contains a language, then it 
also contains all other languages recognised by its syntactic algebra. 

• Polynomial language pseudovariety. A polynomial language pseu¬ 
dovariety is a class of recognisable T-languages that is closed under Boolean 
combinations, morphic preimages, and polynomial derivatives. 

• Syntactic language pseudovariety. A polynomial language pseudova¬ 
riety is a class of recognisable T-languages that is closed under Boolean 
combinations, morphic preimages, and syntactic derivatives. (Since com¬ 
plementation is covered by syntactic derivatives, it suffices to have only 
closure under union and not all Boolean combinations.) 

As usual for pseudovarieties, a T-language in the above definitions is formally 
treated as its characteristic function, which means that a language comes with a 
description of its domain. The reason for this is that it is sometimes important 
to know the input alphabet of a language. Before continuing, let us observe the 
following simple fact. 

Fact 4.1 Polynomial derivatives are a special case of syntactic derivatives. 

Proof. 

Thanks to 0, any T-morphism, not necessarily the syntactic morphism, which 
recognises L will also recognise every polynomial derivative p~^L. □ 

Running Example 4. Call an oo-language definite if there is some n G N such 
that membership in the language depends only on the first n letters. Examples of 
definite oo-languages include: “words that begin with a”, or “words of length at 
least two”. Clearly definite oo-languages are closed under Boolean combinations. 
They are also closed under inverse images of oo-morphisms, because if 

h : E°° ^ 

is an oo-morphism, then the first n letters of h{w) are uniquely determined by 
the first n letters (or less) of w. Here it is important that the monad of oo-words 
does not allow the empty word; there is a natural variant of the monad which 
does have the empty word, and in this variant the definite oo-languages do not 
form a pseudovariety. 
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The same argument as for oo-morphisms applies to functions —>■ 

defined by unary polynomials. Therefore, definite oo-languages are closed un¬ 
der polynomial derivatives as well. Summing up, definite oo-languages form a 
polynomial language pseudovariety. Definite oo-languages also form a syntactic 
language pseudovariety, but this takes a little more effort to check, and will 
follow from Corollary |4.5[ □ 

4.1 The Syntactic Pseudovariety Theorem. 

We have defined two versions of language pseudovarieties, syntactic and poly¬ 
nomial, and therefore there will be two version of Pseudovariety Theorem. In 
this section we present the version which talks about syntactic varieties. The 
proof is essentially a monad version of half of Eilenberg’s proof, because the 
definition of syntactic derivative eliminates the other half. A closer similarity 
with Eilenberg’s full theorem is the polynomial version, which is presented in 
the next section, but which comes at the cost of restricting to settings that use 
sorted sets. 

Eor a class L of recognisable T-languages, define AlgL to be the class of finite 
T-algebras which only recognise T-languages from L. Eor a class A of finite T- 
algebras, define Lan A to be the T-languages recognised by T-algebras from A. 
The Pseudovariety Theorem says that these mappings are mutual bijections 
when restricted to pseudovarieties. 

Theorem 4.2 [Syntactic Pseudovariety Theorem] Consider a setting with the 
following properties. 

• Every recognisable language has a syntactic morphism; 

• Every finite algebra is finitely generated, i.e. its universe has a finite subset 
G such that multiplication is surjective when restricted to TG; 

• Every finite algebra A has a finite subset of its universe Aq with the follow¬ 
ing property. // h : A —>■ B is a surjective T -morphism which is injective 
when restricted to Aq, then h is an isomorphism. 

Then the mapping Lan is a bijection between algebra pseudovarieties and syn¬ 
tactic language pseudovarieties, and its inverse is Alg . 

The rest of Section |4.1| is devoted to proving the Syntactic Pseudovariety 
Theorem. We begin by showing that Alg and Lan produce pseudovarieties 
when given pseudovarieties (of appropriate types, respectively); actually not all 
closure properties are needed for this part. If L is a syntactic language pseu¬ 
dovariety, then AlgL is easily seen to be an algebra pseudovariety. Actually, to 
prove this, we only need to assumption that L is closed under Boolean combi¬ 
nations. This is because every T-language recognised by A x B is a Boolean 
combination of T-languages recognised by A and B. If A is any class of finite 
T-algebras, in particular an algebra pseudovariety, then Lan A is easily to be a 
syntactic language pseudovariety. 
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To finish the proof of the Syntactic Pseudovariety Theorem, it remains to 
show that if L and A are pseudovarieties of T-languages and T-algebras respec¬ 
tively, then 


AlgLanA = A and LanAlgL = L. 

By definition, the class Lan AlgL consists of T-languages that are recognised by 
some finite T-algebra which only recognises T-languages from L. Therefore 

Lan AlgL C L. 

For the converse inclusion, consider a language L € L. By assumption on the 
setting, L has a syntactic algebra, and by definition of language pseudovarieties, 
every language recognised by this syntactic algebra belongs to L. Therefore, L 
is recognised by some algebra which only recognises languages from L. Here 
we have profited from the definition of syntactic derivatives; with polynomial 
derivatives this part of the proof will need to be more involved. 

More effort is required for the equality 

Alg Lan A = A. 

By definition, the class Alg Lan A consists of hnite T-algebras A such that every 
finitely sorted T-language recognised by A is recognised by some T-algebra from 
A. This gives the right-to-left inclusion. The converse inclusion is proved in the 
following lemma. 

Lemma 4.3 Let A be a finite T-algebra sueh that every T-language recognised 
by A is recognised by some T-algebra from A. Then A G A. 

Proof. 

Let G be a finite generating subset of the universe of A, i.e. a subset such that 
mulA is surjective when restricted to TG. By the assumptions on the setting, 
there is a finite subset Aq of the universe of A such that if a surjective T- 
morphism / : A —> B is injective on Ag then it is an isomorphism. For a G Aq 
define 


La = {w G TG : mulA(ic) = a}. 

Let the syntactic morphism of La be 

ha-.TG^ Ba. 

The syntactic morphism exists because La is recognised by a hnite algebra, 
namely A, and therefore the Syntactic Morphism Theorem can be applied. 
Furthermore, by the assumption of the lemma. La, like any language recognised 
by A, is also recognised by some algebra from A. Therefore, the syntactic 
algebra is an image of some algebra in A, and therefore itself belongs to A 
by closure of algebra pseudovarieties under morphic images. Using the dehnition 
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of syntactic morphism again, the syntactic morphism ha must factor through 
mulA- Summing up, G A and there is a surjective morphism fa which makes 
the following diagram commute. 



Define h to be the product of the morphisms ha ranging over a G Ag, which is 
surjective onto its image 

h:TG^BC Ba. 

(iGAq 

The algebra B belongs to A, by closure of A under finite products and subalge¬ 
bras. Defining / to be the product of all fa, we see that the following diagram 
commutes. 



B 


To prove that / is actually an isomoprhism, it suffices to show that / is inective 
when restricted to Aq. Because G are generators, every element a G Ag is the 
image under mulA of some iCq G TG. Furthermore, if a 7 ^ 6 , then h(wa) h(wi,), 
because only one of Wa,Wb belongs to the language La that is recognised by h. 
□ 

4.2 The Polynomial Pseudovariety Theorem 

In this section, we prove that if the setting uses sorted sets with finitely many 
sorts, then both the syntactic and polynomial versions of language pseudovariety 
coincide. The key result is Lemma |4.4| below, which says that syntactic deriva¬ 
tives can be represented as inverse morphic images of Boolean combinations of 
polynomial derivatives. This lemma is essentially the other half of Eilenberg’s 
proof, which was not used in the syntactic version of the pseudovariety theorem. 
We state Lemma |4.4| in the more general setting with possibly infinitely many 
sorts. In such a setting, call a language finitely sorted if on all but finitely many 
sorts it is full or empty. When there are finitely many sorts, then all languages 
are finitely sorted. 

Lemma 4.4 Consider a setting where the category is sorted sets, with possibly 
infinitely many sorts, and that the notion of finite algebra is such that finiteness 
of an algebra implies that the universe is finite on every sort. Then for every 
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recognisable T-language L, every finitely sorted syntactic derivative of L is an 
inverse image, under some T -morphism, of a Boolean combination of polynomial 
derivatives of L. 

Before proving the above lemma, let us note two corollaries, and an example 
of a setting where the conclusion of the lemma is violated. 

Corollary 4.5 Consider a setting where the category is sorted sets with finitely 
many sorts, finite alphabets are finite sorted sets, and the notion of finite algebra 
is such that finiteness of an algebra implies that the universe is finite. Then 
syntactic language pseudovarieties are the same thing as polynomial language 
pseudovarieties. 


Proof. 

Lemma [4.4| implies that every polynomial language pseudovariety is closed under 
syntactic derivatives, and is therefore a syntactic language pseudovariety. The 
converse is Fact |4.1[ which says that closure under syntactic derivatives implies 
closure under polynomial derivatives. □ 

Actually, the above corollary would also be true with infinitely many sorts, 
with a modified definition of language pseudovariety where only finitely sorted 
languages are allowed. By combining the above corollary with the Syntactic 
Pseudovariety Theorem, we get the Polynomial Pseudovariety Theorem stated 
below. 


Corollary 4.6 [Polynomial Pseudovariety Theorem] Under assumptions on the 
setting as in Corollary \f.t^ Lan is a bijection between T-algebra pseudovarieties 
and polynomial T -language pseudovarieties, and its inverse is Alg . 

An advantage of the polynomial version of the pseudovariety theorem is that 
it is sometimes easier to check if a class is closed under polynomial derivatives, 
as compared to syntactic derivatives. This was the case for definite oo-languages 
discussed previously in the running example. 


Running Example 5. As an illustration of the Polynomial Pseudovariety 
Theorem, it is easy to see that Alg takes the class of definite oo-languages to 
the class of definite oo-algebras, and the mapping Lan goes the other way. An 
oo-language is definite if and only if its syntactic oo-algebra is definite. □ 
Here is an example of a setting which violates the conclusions of the Polyno¬ 
mial Pseudovariety Theorem, and therefore also the conclusions of Lemma [4^ 


Example 2. In the proof of the Lemma 4.4 we will use the following property 
of the category of sorted sets: if g : X —)■ F is surjective, then there is an inverse 
-.Y —)■ X such that g o g~^ is the identity on Y. An example of a category 
where this assumption fails is nominal sets. In nominal sets, the Polynomial 
Pseudovariety Theorem also fails, as we show in this example. The example 
assumes familiarity with nominal sets, and orbit-finite sets. 
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Consider the category where objects are finitely supported nominal sets and 
morphisms are finitely supported functions. Consider the monad of finite words 
in this category, where algebras are finitely supported semigroups. To complete 
the definition of the setting, define finite alphabets to be finitely supported 
sets which are orbit-finite, and define finite algebras to be finitely supported 
orbit-finite semigroups. This setting was studied in |Bojl3| , although not using 
the monad terminology. The Syntactic Morphism Theorem holds in this set¬ 
ting, as was shown in Lemmas 3.3 and 3.4 of |Bojl3| . We will show that the 
Pseudovariety Theorem fails in this setting. 

Here is the property of finitely supported functions that will make the Poly¬ 
nomial Pseudovariety Theorem fail. Let A denote the atoms underlying the 
nominal sets, let P 2 A be size two sets of atoms, i.e. unordered pairs of atoms. 
One can show that if 


/ : P 2 A ^ A+ 

is a function supported by a finite set S of atoms, 

/({a, b}) = /({c, 4) for a, b,c,d^ S. (4) 

Define L to be the polynomial language pseudovariety generated by all recog¬ 
nisable languages over the alphabet A. It is not difficult to see that a language 
L C E+ belongs to L if and only if there is a finitely supported monoid morphism 

: E+ ^ A+ 

such that L is an inverse image under h of some recongisable subset of A+. We 
will show that Alg L is not an algebra pseudovariety, because it is not closed 
under morphic images. 

Consider the following two languages. 

1. The alphabet is ordered pairs of atoms, i.e. A^. The language consists of 
two letter words over this alphabet such that the two atoms which appear 
in the first letter are pairwise distinct from the two atoms that appear in 
the second letter. In other words, this language is 

Li = {(a, 5)(c, d) : {a, b} n {c, d} = 0} C (A^)+ 

This language is recognised by a semigroup, call it Si, whose universe is 

A'u{T,T}, 

with elements of A^ describing one letter words, with T describing words 
in the language, and with T describing words of length at least two that 
are outside the language. Although Si recognises the language Li, it is 
not its syntactic semigroup. To get the syntactic semigroup, one needs to 
identify ordered pairs that correspond to the same set, i.e. the syntactic 
semigroup, call it S 2 , has universe 

P2 AU{T,T}. 
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Clearly S 2 is an image of Si under a finitely supported semigroup mor¬ 
phism, namely the function which forgets the order in pairs. Therefore, 
any algebra pseudovariety with Si will also contain S 2 . 

2. Here is a language that is recognised by S 2 . The alphabet is unordered 
pairs of atoms, i.e. P 2 A. The language consists of two letter words over 
this alphabet such that the set in the first letter is disjoint with the set in 
the second letter. In other words, this language is 

L 2 = {{a, b}{c, d} : {a, b} n {c, d} = 0} C (P 2 A)''‘ 


We claim that AlgL contains Si but not S 2 , and is therefore not an algebra 
pseudovariety. It is not difficult to show that Si recognises only languages from 
L, and therefore it belongs to Alg L. We only show that L 2 is not in L, and 
therefore S 2 is not in Alg L. To this end, we need to show that there is no 
finitely supported semigroup morphism 

h : (P2A)+ ^ A+ 


such that L 2 is an inverse image of some recognisable subset of A"*". Indeed, 
by the function h would need to assign the same value to two different 
letters in P 2 A, and therefore it could not recognise ^ 2 - D 


The rest of this section is devoted to proving Lemma 4.4 


Lemma 4.7 Assume the assumptions of Lemma \4-4\ Let L be a recognisable 
T -language. Every finitely sorted language recognised by the syntactic morphism 
of L is is a Boolean combination of polynomial derivatives of L. 


Proof. 

Let L C TE be a recognisable T-language. Let K he a finitely sorted language 
recognised by the syntactic morphism of L, in particular the K is also a subset 
of TE. We want to show that AT is a Boolean combination of derivatives of 
L. A finitely sorted language is a finite union of single-sorted languages, and 
therefore without loss of generality, we can assume that K entirely included in 
a single sort, call it r. 

Claim 4.7.1 There is a finite set P C pol.,.TE such that structures w,w' € TE 
of sort T have the same image under the syntactic morphism of L if and only if 

p{w) G L iff p{w') G L for every p G P. (5) 

Proof. 

By construction of the syntactic morphism in the proof of the Syntactic Mor¬ 
phism Theorem, structures w, w' have the same image under the syntactic mor¬ 
phism if and only if ([^ holds for every polynomial p G pol^TE, not necessarily 
from some finite set P. In other words, one can choose for every w, w' a poly¬ 
nomial Pw,w' such that w and w' have the same image under the syntactic 
morphism if and only if 

IPw.w'liw) G L iff |p«,,u,'l(w') G A. 
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Furthermore, because the syntactic morphism recognises L, the choice of Pw,w' 
need need only depend on the images of w and w' under the syntactic morphism, 
for which there are finitely many possibilities. □ 

Stated differently, the claim says that structures in sort r have the same 
image under the syntactic morphism if and only if they belong to the same 
polynomial derivatives p~^L for p belonging to the hnite set P in the statement 
of the claim. This means that K, being a subset of sort t that is recognised by 
the syntactic morphism, is a Boolean combination of finitely many derivatives. 
□ 

Lemma 4.8 Let T be a set and let f : A ^ IS and h : TE —)■ B &e T -morphisms. 
If f is surjective, then there is some T-morphism g which makes the following 
diagram commute 


A 



/ 


B 


Proof. 

Because / is surjective, there is a function g' which makes the following diagram 
commute. 



Consider the following diagram. 


TA 



By definition of g', the lower face commutes. The upper left face commutes 
because multiplication in an algebra must maps units to themselves, while the 
upper right face comes from the assumption that the unit in a monad is a natural 
transformation. All arrows on the perimeter of the diamond-shaped diagram 
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describe T-morphisms. Therefore, both paths which begin with the edge 77 s and 
end in B describe the same function. Because TS is generated by the units of 
S, and both paths from TE to B are (compositions of) T-morphisms, it follows 
that both paths from TS to B describe the same T-morphism. Therefore, g in 
the statement of the lemma can be taken to be mulA oTg'. □ 


Proof, (of Lemma 4.4) 

The lemma says that if L C TT is a recognisable T-language, then every finitely 
sorted syntactic derivative of L is an inverse image, under some T-morphism, 
of a Boolean combination of polynomial derivatives of L. Let the syntactic 
morphism of L be 


/ : TT B. 

Suppose that K C TE is a finitely sorted syntactic derivative of L, i.e. it is 
recognised by some T-morphism 

/i : TE ^ B. 

By Lemma |4.8[ there is a T-morphism g which makes the following diagram 
commute. 





f 

I 

B 


In other words, K is an inverse image, under g, of some language M recognised 
by the syntactic morphism /. We can assume without loss of generality that M 
is empty (respectively, full) on sorts where K is empty (respectively, full), and 
therefore M is also finitely sorted. By Lemma 4.7 M is a Boolean combination 
of polynomial derivatives of L. □ 

This completes the proof of the Polynomial Pseudovariety Theorem. 


5 Representing an algebra 

In all interesting cases, the monad T produces infinite sets, even on finite argu¬ 
ments. Therefore, the finiteness of the universe of a T-algebra A does not, on 
its own, imply that the algebra itself has a finite representation, because one 
needs some way of representing the algebra’s multiplication operation 

mulA : TA —>■ A. 

In this section, we present one such way. We assume that the monad is in the 
category of sets, or sorted sets. The idea is to find a function Tq, which chooses 
for every finite set A a finite subset TqA C TA such that: 
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1. for every finite T-algebra A with universe A, the multiplication operation 
is uniquely determined by its values on TqA; 

2 . the function A TqA can be computed, modulo some representation of 
elements in TqA C TA. 

For instance, in the monad of finite words, the function Tq maps a set A to 
word over A of length two, because a semigroup is uniquely determined by its 
neutral element and its binary multiplication. In the example of oo-algebras, 
the function To maps A to words over A of length two and to infinite words of 
the form for some a G A. We now describe these notions in more detail. 

Subfunctors. Because the monad is in the category of sets, or sorted sets, 
the notion of subset can be used. Define a subfunctor of a monad T to be a 
mapping which takes every set A to a subset TqX C TX. A subfunctor on 
its own is not a monad (as defined here it is not even a functor), however it 
can be used to generate a monad as follows. For an ordinal number a, define 
Tq A C TA as follows by transfinite induction: TgA is the units of A, while for 
a > 0 we have 

T“A U mulxxToT^A. 

P<a 

By monotonicity, this sequence must stabilise at some value, which is denoted 
by TqA. If the monad is finitary, i.e. every element w gTX belongs to w G TY 
for some finite Y C X, then the sequence stabilises at uj, i.e. induction only on 
natural numbers is needed. It is not difficult to show that Tg is a submonad of 
T, i.e. a subfunctor with the monad structure inherited from T. A subfunctor 
Tg is said to span an algebra A if 

mulATgA = mulATA 

holds for every subset A of the universe. A subfunctor is called complete if it 
spans every T-algebra, and finitely complete if it spans every finite T-algebra; 
note how this depends on the notion of finite T-algebra. 

Running Example 6. Consider the monad oo for infinite words. Define 

TgA = {xy,x‘^ : x,y G X}. 

It is not difficult to check that the submonad Tg maps A to the finite and 
ultimately periodic words over alphabet A. Using the Ramsey Theorem, in the 
same way as it is used explicitly by Wilke in |Wil91| . and implicitly by Biichi 
in |Buc62| . we show that Tg is finitely complete. Indeed, let A be a subset of 
the universe in some finite cxD-algebra A. To show that Tg spans A, we need 
to show that if ic G A°°, then there is some ultimately periodic word v over A 
such that 


mulA(rc) = mulA(r')- 
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If w is finite, then it already is ultimately periodic. Otherwise, using the Ramsey 
Theorem, one can decompose w as 


w = W 0 W 1 W 2 ■ ■ ■ with wq, Wi, ... G 


such that mulA gives the same result for all the finite words Wi, W 2 , ■ ■ ■■ Let 
be the image of Wi under mulA- By assumption that all are the same for 
i > 1 and by associativity, we have 

mulA(rc) = mulA(aoaiai ■ ■ ■) = mulA(wo(wi)‘^), 

and the latter uses an ultimately periodic word. As we shall see, the argument 
made in this example is the only part of the proof of decidability of MSO on 00- 
words that needs to be proved by hand; the remainder of the proof will follow 
from abstract principles stated in Theorem |6.3[ □ 

Reducts. Consider a subfunctor Tq that is finitely complete for a monad T. 
For a finite T-algebra A, define its Tg-reduct to be the pair consisting of the 
universe A of A, and the restriction of the multiplication operation from A to 
the subfunctor: 


hiuIaItoA : TqA a 

The Tp-reduct is a special case of what category theorists call an algebra over 
signature T 0 . Straight from the definition it follows that if Tg spans A, then 
A is uniquely determined by its Tg-reduct. In particular, if Tg is complete, 
then every algebra over signature Tg extends to at most one T-algebra. Note 
the “at most one” in the previous sentence; some algebras over signature Tg 
might not extend to T-algebras, e.g. not every binary operation extends to a 
semigroup operation, because for this associativity is needed. The same holds 
for finite completeness and finite algebras. The point of using Tg-reducts is 
that sometimes Tg can be chosen so that it preserves finiteness, and therefore 
Tg-reducts can be manipulated by algorithms, at least as long as finite objects 
and functions between them can be manipulated by algorithms. 

Running Example 7. The T g-reduct of a finite 00 -algebra consists of a finite 
universe A together with two operations, of aritites two and one: 

_-_:AxA^A 

This is essentially the same thing as a Wilke semigroup. Not every choice of 
finite universe and three operations above will yield an Tg-representation of 
some finite 00 -algebra; this requires the operations to satisfy certain axioms, 
e.g. Wilke gives such axioms in Definition 3 of |Wil93j . □ 
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Computing the syntactic T-morphism. The point of T o-reducts is to have 
a finite representation of T-algebras so that they can be manipulated by algo¬ 
rithms. We give one example of such an algorithm, namely the Moor^ algo¬ 
rithm. This algorithm computes the syntactic morphisms in polynomial time. 
To state this result, we need to explain how morphisms are represented. Con¬ 
sider a subfunctor Tq. We assume that it is effective, in the sense that TqS 
can be computed up to isomorphism based on E for finite E, in particular Tq 
preserves finiteness. The Tg-representation of a finite T-algebra is simply the 
finite multiplication table that gives the values of mulA for arguments from 
TqA. The Tp-representation of a T-morphism h : TE —>■ A consists of the 
To-representation of the algebra, as well as the values of h for units. If Tg has 
polynomial size increase, as is the case in the examples of monoids or oo-algebras 
discussed in Examples and then the Tg-representation of an algebra will be 
of size polynomial with respect to the size of the universe. However, there will 
be examples where Tg has exponential size increase, e.g. in Sectionj^in the case 
of countable chains. 

Lemma 5.1 Let T be a monad in a category of sorted sets, with finitely many 
sorts, and let Tq be a subfunctor that is complete for finite algebras. Then syn¬ 
tactic T -morphisms can be computed for T -recognisable languages, in polynomial 
time with respect to To-representation. 

Proof. 

Using the Moore algorithm. □ 


6 Monadic second-order logic 

An important part of the theory of regular languages is the connection between 
recognisability and definability in monadic second-order logic MSO. This con¬ 
nection says that languages recognised by finite recognisers are the same thing 
as MSO definable languages. Examples where this connection holds include: fi¬ 
nite words (as proved independently by Bfichi, Elgot and Trakhtenbrot), infinite 
words (as proved by Bfichi), finite trees (as proved by Thatcher and Wright), 
infinite trees (as proved by Rabin), etc. There are common ingredients in all 
of the proofs, and there are parts that are specific to each domain. In this 
section, we show that the common ingredients can be stated and proved on the 
abstract level of monads. This takes care of much of the symbol pushing in 
the proofs, and leaves only the combinatorial parts to be proved in each spe¬ 
cific case, e.g. nothing is left to be proved for finite words or trees, or only the 
Ramsey theorem needs to be applied in the case of oo-words. 

6.1 Language theoretic definition of MSO 

To establish the connection between MSO and recognisability, consider the fol¬ 
lowing lemma, see |Tho96) . which characterises MSO in a way that does not talk 

^This is not the same Moore as in Eilenberg-Moore algebras. 
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about “positions” or “sets of positions” of a structure, but is defined in purely 
language theoretic terms. 

Lemma 6.1 A language L CT,* is definable in MSO if and only if it belongs to 
the least class of languages that is closed under Boolean combinations, images 
and inverse images of morphisms /i : E* —>■ F*, and which contains the languages 

0*C{0,1}* and 0*1* C {0,1}*. 

A similar lemma holds for infinite words (instead of 0*1* one uses 0*1°°), 
and also for finite and infinite trees, etc. Motivated by the above, we define 
an abstract notion of MSO in a monad T. In the abstract version, predicates 
are modelled by languages. For a set £ of T-languages, dehne MSOt(>C) to be 
the smallest class of T-languages which contains £, is closed under Booolean 
operations, images and inverse images of T-morphisms. 

The following lemma is in the category of sets, or more generally, in cate¬ 
gories which have a powerset functor that preserves finiteness. A non-example 
is the category of nominal sets with orbit-finite sets, where powerset does not 
preserve orbit-finiteness, and also MSO contains non-recognisable languages, 
see |Bojl3| . 

Lemma 6.2 IfC contains onlyT-recognisable T-languages, then so does msOt(£J. 

Proof. 

To prove the lemma, one needs to show that T-recognisable languages are 
closed under Boolean operations, images of T-morphisms, inverse images of 
T-morphisms. For Boolean operations we use products, for inverse images the 
property is immediate. The only nontrivial part is the images, where we use the 
powerset construction, defined as follows. We write PX for the powerset of X. 

If AT is a set, then we say that w &TX belongs pointwise to u G TPX if there 
is some element of 


T{{a £ X,b e PX) : a G 6 } 

which projects to w and v respectively on the first and second coordinates. For 
a T-algebra A, define its powerset to be the T-algebra 

PA ; TPA ^ PA 

whose multiplication operation maps w G TPA to the set 

{mulA(^^) : V gTA belongs pointwise to w}. 

It is not difficult to check that this is indeed a T-algebra, for the distrustful see 
Johnstone []. □ 
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6.2 Deciding satisfiability of MSO 

For a monad T, we define MSO satisfiability over T to be the following decision 
problem. An instance is what one can see as an MSO formula, which is formalised 
as an expression that uses the constructors of MSO formulas, with the predicates 
being represented by T-morphisms recognising them. The question is whether 
the language corresponding to the instance is nonempty. 

In this section we give a sufficient criterion for the decidability of MSO sat¬ 
isfiability. We assume that the monad is in the setting of finitely sorted sets. 

Strongly effective subfunctor. Recall the notion of an effective subfunctor 
To from Section which said that if S is finite then TqE is also finite and 
can be computed based on E. As discussed in Section if a monad T has an 
effective subfunctor Tg that is finitely complete, then a finite T-algebra can be 
represented by its multiplication table restricted to Tg, while a T-morphism 

h : TE ^ A 

where E is a finite alphabet and A is finite can be represented by its values on 
generators, i.e. units of E. For the results on MSO of this section, we will need 
a stronger assumption, which says that algebras recognising singleton sets can 
be computed. A subfunctor Tq is called strongly effective if for every finite set 
E and every w S TgE, one can compute a representation of a T-morphism 

h : TE ^ A 

into a finite T-algebra that recognises {w}. 

Example 3. Consider the monad oo of infinite words, and the subfunctor 
TgA = {e,xy,x‘^ : x,y G X}. 

which was considered in Examples and and proved to be finitely complete. 
We claim that Tg is strongly effective. Clearly Tg preserves finiteness and can 
be computed, as TgA is isomoprhic to 1 U U X. For a finite alphabet E and 
a, & € E it is not difficult to compute Tg-reducts of c»-algebras that recognise 
the languages {a6} and {a‘^}. Let us do the case of {a“}. The oo-algebra has 
four elements in its universe, representing the empty word, finite words in a'^, 
the unique infinite word a‘^, and finally words that use some letter other than 
a. □ 

The following theorem shows that a sufficient crieterion for decidable MSO 
satisfiability is having a subfunctor that is finitely complete and strongly effec¬ 
tive. 

Theorem 6.3 Let T be a monad in the setting of finitely sorted sets. If there 
is a subfunctor Tg that is strongly effective and finitely complete, then MSO 
satisfiability is decidable. 
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As mentioned at the beginning of this section, Theorem 6.3 is abstract non¬ 
sense in the sense that it does not resolve the actual combinatorics necessary 
to prove satisfiability of MSO. This can be seen in the series of Examples]^ 
and which show that the monad of infinite words has a subfunctor that is 
finitely complete and effective, and therefore Theorem |6.3| can be invoked to 
show that satisfiability of MSO is decidable over infinite words. The decidability 
proof that comes from these examples has the same structure as the original 
proof of Biichi |Biic62] , or its algebraic version in [Wil93| . What the examples 
show is that a large part of the proof is sufhciently generic to be stated on the 
abstract level of monads; and the only challenge is finding a subfunctor that is 
finitely complete and strongly effective, with finite completeness being essential 
part. 

Theorem 6.3 follows immediately from the following lemma. 


Lemma 6.4 From multiplication tables ofTo-reducts of a finite T-algebras 
one ean compute multiplication tables of the To-reducts of PA and A x B. 


Proof. 

The Cartesian product is immediate, the interesting case is the powerset PA. 
For w G To (PA), we need to compute mulpA(w). By strong effectivity of Tq, we 
can compute a T-morphism 


h : T(PA) -> B 

that recognises the singleton {w}. Define S to be the finite set of pairs (a, Aq) 
such that a G Aq C A and consider the T-morphism 

g : TE —)• A x B 

which works like mulA on the first coordinate, and like h on the second coordi¬ 
nate. By definition of the powerset algebra, 

mulpA(rc) = {a : some v G TE satisfies g(v) = (a,h(w))}. 

Therefore, to compute the above, it suffices to be able to compute the image 

g(TE) C A X B. 

Because Tq spans every finite T-algebra, the above image is the same thing as 
the smallest subset of A x B that contains images of single letters from E, and 
which is closed under g restricted to Tq. This subset can be computed. □ 
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Part II 


Example Monads 

In this part, we give examples of how monads can be used to describe algebraic 
approaches to the languages for labelled chains (Section]^, unary queries over 
finite words (Section]^ and various kinds of trees (Section]^. These examples 
illustrate the general theorems from the first part, i.e. the Syntactic Morphism 
Theorem, the Eilenberg Pseudovariety Theorem, and the results on MSO. 
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7 Monads for chains 


In this section, we show monads for representing chains, which are a generalisa¬ 
tion of infinite words, where the set of positions can be any total order, e.g. the 
rational or even real numbers. A chain over an alphabet S is defined to be a 
nonempty totally ordered set of positions, together with a labelling of these po¬ 
sitions by E. Chains form a monad, modulo the issue that all chains over a given 
alphabet do not form a set. The unit of this monad interprets an element a S E 
as a chain with a single position labelled by a. The multiplication of a chain of 
chains w is defined by taking positions to be pairs (i,j) such that j is a position 
in w, and j is a position in the label of position i, ordered lexicographically. 

Shelah showed in [She75| that it is undecidable if a sentence of MSO is true 
in ordered real numbers (K, <), which can be seen as an unlabelled chain, or 
equivalently, a chain over a one-letter alphabet. This implies that satisfiability 
of MSO is undecidable on arbitary chains, or even on chains of cardinality con¬ 
tinuum, i.e. one cannot decide, given an MSO formula with a binary predicate 
for the order, whether or not the formula is true in some chain. The binary 
predicate for the order can be seen as the language of chains over the alphabet 
{0,1} where all zeros are before all ones. It follows that the assumptions of 
Theorem 6.3 cannot be met, even for chains of cardinality at most continuum. 
These problems go away if one considers countable chains. 


Countable chains. A countable chain is one where the set of positions is 
countable. A countable chain is called scattered if its indexing set is scattered, 
i.e. its positions do not embed an isomorphic copy of the rational numbers. 
A special case of a scattered chain is a countable well-chain, i.e. one where 
the positions are well-ordered. These three kinds of chains are submonads of 
the monad of chains, i.e. they form monads when equipped with the unit and 
multiplication inherited from the monad of all chains. 

The following theorem shows that in all three cases, the algebras admit 
finitely complete subfunctors, as defined in Section which are also strongly 
effective as defined in Section |6l The cases of countable well-founded and count¬ 
able scattered chains are simple enough to warrant a self-contained proof, mod¬ 
ulo the Hausdorff theorem on scattered chains. The case of arbitrary countable 
chains is more involved and follows from [She75| . see also |CCP11) . 

Theorem 7.1 


1. Every finite algebra in the monad of countable well-chains is spanned by 

X ^ {x ■ y,x‘^ •. x,y & X} 

2. Every finite algebra in the monad of countable scattered chains is spanned 
by 

X 1 -^ {x ■ y, x'^ ,x~‘^ : x,y G X} 
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3. Every finite algebra in the monad of countable chains is spanned by 


X 1 -^ {x ■ y, x‘^,x shuffleF : x,y G X,Y C X} 

where shuffleF is the chain where the positions are rational numbers and 
where every y G Y labels a dense subset (such a chain is unique up to 
isomorphism). 

Proof, (of the first two cases) 

The Hausdorff theorem on scattered chains says that scattered chains are the 
smallest class of chains that contains the finite chains, chains indexed by ui and 
—w, and is closed under substitution. For well-founded countable chains, the 
same holds, but —oj is not allowed. The result then follows, using the Ramsey 
theorem in the same way as in the case of oo-algebras. □ 


Corollary 7.2 Satisfiability for MSO is decidable on: all countable chains, scat¬ 
tered chains, and well-ordered countable chains. 


Proof. 

It is easy to see that the subfunctors given in Theorem 7.1 are strongly effective. 
Therefore, the result follows from Theorem 16.31 □ 


In particular, for the well-chains and the scattered chains, we get a simple 
self-contained proof of decidability for MSO. This proof is no different from the 
known ones, but the advantage of using monads is that they clearly identify 
which part of the argument is specific to the monad being used. 


8 Pointed words 

This section presents a monad which generates a new kind of algebra, which, 
although simple, has not appeared in the literature up to the author’s best 
knowledge. The monad, call it Point, is defined by 

Points = A*AA*, 

where ^ is a disjoint copy of the set A. Elements of Points are called pointed 
The idea is that a pointed word represents a nonempty word over A 
where the underlined position is selected, and therefore a pointed word can be 
used as an input to a unary query that tests properties of positions in a word. 
Therefore we will use the term unary query for a set of pointed words. The unit 
operation is a i—>■ a, while the monad multiplication operation is the same as in 
the monad of finite words, except that the underlined position is the underlined 
position in the underlined word. 

A pointed word can be viewed in two ways: as a nonempty word over al¬ 
phabet E with a distinguished position, or as a special case of a non-pointed 

^Similar ideas would work for pointed chains, pointed trees, etc. 
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word over an extended alphabet SUE. In logical terms, the first view proposes 
that sets of pointed words are defined by unary queries (i.e. formulas with one 
free individual variable) over the alphabet E, and the second view proposes that 
sets of pointed words are defined by Boolean queries (i.e. with no free variables) 
over the extended alphabet E U S. For some logics, the two views are essentially 
the same. For instance a set of pointed words is MSO definable in the first view 
if and only if it is MSO-definable in the second view. The same is true for first- 
order logic with the order predicate. Therefore, for some logics such as MSO or 
first-order logic, characterising unary queries reduces to characterising Boolean 
queries. However, for some logics this is not the case. 

In Section |8.1[ we will show that for two-variable first-order logic, charac¬ 
terising unary queries is not easily reducible to characterising Boolean queries 
over extended alphabets. We also show how finite Point-algebras are useful in 
characterising unary queries. Along the way, we use much of the machinery 
developed in Part I of this paper, in particular the Syntactic Morphism theo¬ 
rem, the Pseudovariety Theorem, and the results on representation. All of these 
would be relatively straightforward to prove by hand in the special case of the 
monad Point, but deducing them from abstract nonsense allows us to focus on 
the more specific and combinatorial parts of the proof. 

Much of the material in Section |8.1| is specific to unary queries definable 
in two-variable first-order logic, and the reader who is more interested in the 
general principles of monads is advised to skip it. 

8.1 Unary queries definable in two-variable first-order logic 

To illustrate the monad of pointed words Point, consider the fragment of first- 
order logic that uses only two variables, but which is allowed to reuse them by 
requantifying. The logic has access to predicates for the labels and the order, 
but not for the successor, although similar results are true for other choices of 
predicates. We say that a set of pointed words, i.e. a unary query, is two-variable 
definable if it can be defined by a formula of two-variable first-order logic that 
has one free variable, say x, and which uses the predicates described above. 
In the semantics of the formula, the free variable binds the selected position, 
but once the free variable of the query is requantified, the selected position is 
forgotten. For example the unary query “the distinguished position is followed 
by at least two positions with label a” can be defined in two-variable first-order 
logic, although only thanks to using requantification: 

(f{x) = 3?/ (a; < y A a[y) A 3a: {y < x A a{x))). 

It is not immediately clear how to define the unary query “the successor of the 
distinguished position has label a”, because the natural formula would use three 
variables to define successor in terms of order: 

tpix) =3y {x < y A a{y) A Vz(z < xV y < z)). 

In fact, the unary query il^{x) cannot be defined using two variables, as long as 
the vocabulary has predicates just for the order and labels, which is our chosen 
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setting in this section. This example illustrates that with only two variables, the 
choice of vocabulary is more important than in first-order logic with arbitrarily 
many variables. 

The two-variable fragment of first-order logic is a well-studied logic for non- 
pointed words, i.e. for Boolean queries on words, see e.g. |TW98| . but it also 
makes sense for unary queries, as it corresponds to unary queries definable in 
XPath with only the transitive axis // and its invers^ 

We will show that two-variable definable languages form a pseudovariety, 
and therefore by the Pseudovariety Theorem, definability of a language in two- 
variable logic depends only the syntactic Point-algebra of the language. The 
Pseudvariety Theorem alone does not give an algorithm to decide this definabil¬ 
ity, but such an algorithm is given in Theorem |8.2| 


The transformation monoids. In every Point-algebra there is a hidden 
monoid, actually two monoids. Consider a Point-algebra A. For a G A, de¬ 
fine its left transformation to be the function A —>■ A defined by 

b I—mulA(a6)- 

Likewise we define the right transformation. Left transformations form a monoid, 
equipped with function composition, call it the left monoid. If A is finite then 
so is the left monoid. Likewise one can define right transformations and the 
right monoid. It is not difficult to see that a Point-algebra is uniquely specified 
by its universe A and the the left and right transformations for each a € A. In 
other words, using the terminology of Section the subfunctor 

A 1 —^ {ab, of) : a,b G A} 


is complete for all Point-algebras. It is also strongly effective as defined in Sec¬ 
tion It follows that a finite Point-algebra can be represented in space polyno¬ 
mial in the size of its universe; and that syntactic algebras can be computed in 
polynomial time (by Lemma 5.11. 

The following example shows that just looking at the left and right monoids 
of a unary query is not sufficient to decide if it is two-variable definable. Stated 
in the language of temporal logic, the example shows that two-variable logic 
does not have the separation property. 


Example 4. Let us revisit the successor query discussed at the beginning 
of this section. Let the alphabet be {a, 6}, and consider the unary query “the 
successor of the selected position has label a”, i.e. 

{waav :w,vG {a,b}*,a G {a, 5}} C Pointja, 6}. 

When seen as a language over an extended alphabet, the above is definable by 
a formula of two-variable logic without free variables. The formula says that 

^To be fair, the XPath motivation would be best justified by studying the tree variant of 
the logic. Preliminary research indicates that the results from this section can be generalised 
to trees. 


38 






there exists a position with label a, such that one can go one step to the left 
and find the underlined position, but one cannot go two steps to the left and 
find the underlined position. When seen as a unary query over the alphabet E, 
the above is not two-variable dehnable, which will follow from Theorem |8. 2 1 

Also, one can observe that just looking at the left and right monoids is 
not sufhcient to understand the query. In this case, the left monoid is trivial, 
i.e. contains only the identity transformation, while the right monoid is the 
syntactic monoid of the language “words beginning with a”. Both monoids 
have the property that they recognise only languages definable in two-variable 
first-order logic. □ 

The above example shows that characterising unary queries definable in two- 
variable logic does not simply reduce to characterising languages (i.e. Boolean 
queries) definable in two-variable logic over an extended alphabet. 

An Ehrenfeucht-PraTsse game. We now show that two-variable definable 
unary queries form a pseudovariety of Point-languages. Therefore, by the Pseu¬ 
dovariety Theorem, the syntactic Point-algebra of a unary query has sufhcient 
information (unlike the left and right monoids) to decide if the query is two- 
variable dehnable. 

We do this using Ehrenfeucht-Fraisse games in a standard way. Consider two 
pointed words wq, wi. For n S N, dehne the following game, which is played by 
players Spoiler and Duplicator. At the beginning of the game, the labels of the 
selected positions in the two pointed words are checked; if they are different then 
Spoiler wins immediately and the game is terminated. If the selected positions 
have the same labels, then n rounds of the game are played as follows. At the 
beginning of each round Spoiler chooses i € {0,1} and a direction, which is one 
of “left”, “stay” or “right”. Then Spoiler changes the selected position in the 
pointed word Wi according to the direction, i.e. if the direction is “left” then the 
selected position is moved somewhere to the left, if it is “stay” than it is not 
changed, and if it is “right” then it is moved to the right. Duplicator responds 
by choosing a choosing a new selected position in the other pointed word wi-i, 
according to the direction chosen by Spoiler, and such that the new selected 
positions have the same labels. If Duplicator cannot do this, then Spoiler wins 
immediately and the game is terminated. Otherwise, another round is played 
with the new selected positions; and if all n rounds are played without Spoiler 
winning, then Duplicator wins. 

We write wq wi if Duplicator has a winning strategy in the n-round 
game. It is not difficult to show that wq wi holds if and only if wo,wi 
satisfy the same unary queries of two-variable logic of quantifier depth n. The 
following lemma, which is proved by composing winning strategies for Duplicator 
in an obvious way, says that equivalence under is preserved under unary 
polynomials and Point-morphisms. 
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Lemma 8.1 If pointed words satisfy wq wi then 

IpK^^o) Ipl('*i'i) for every unary polynomial in PorntYj 

h(wo) h{wi) for every Po\v\t-morphism f : Points —?• PointF 

A corollary of the above lemma is that unary queries that are two-variable 
definable form a pseudovariety of Point-languages. Closure under Boolean com¬ 
binations is immediate, while for closures under derivatives and inverse images 
under Point-morphisms, one uses Lemma [STT] and the fact that a Point-language 
is two-variable definable if and only if it is a finite union of equivalence classes 
of for some n. 


An effective characterization As stated above, two-variable unary queries 
form a pseudovariety of Point-languages, and therefore the Pseudovariety The¬ 
orem can be invoked to show that whether or not a unary query is two-variable 
depends only on its syntactic Point-algebra. Is this dependency effective? In the 
case of Boolean queries, i.e. languages of non-pointed words, this problem was 
solved in |TW98| . where it was shown that a language L C E* is two-variable 
definable if and only if its syntactic monoid belongs to a class of monoids called 
DA. The class da is a pseudovariety of monoids that can be defined by two iden¬ 
tities, and therefore membership in it is decidable. In the following theorem, 
we extend the result of |TW98) from Boolean queries to unary queries, i.e. from 
non-pointed words to pointed words. 

Theorem 8.2 Let q C Points be a unary query recognisable by a finite Point- 
algebra. Then q is two-variable definable if and only if its syntactic Po\nt-algebra 
A satisfies: 

• the left and right monoids of A belong to DA; 

• for every w € A+ which all letters in a set B Q A, every b G B and every 
V € Point??, the following equality holds for all but finitely many n G N; 

mulA(tc"^r’rc") = mulA(rc"r’'ic") = mulA(rc”r'&w”). 


The rest of Section [ST] is devoted to proving the above theorem. We begin 
with a corollary of the theorem, which says that definability of unary queries 
in two-variable logic can be decided in polynomial time. When talking about 
polynomial time, we refer to representation of Point-algebras with respect to 


A I—)■ {ab, ab : a^b G A}. 


When a, b are in the universe of a Point-algebra A, we will treat ab as an element 
of A, although the more formally correct notation would be mulA(a&). 

Corollary 8.3 Whether or not a recognisable q C PointS is two-variable defin¬ 
able can be decided in polynomial time with respect to the recognising morphism. 
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Proof. 

By Lemma |5.1[ the syntactic morphism can be computed in polynomial time 
based on any recognising morphism into a finite algebra. Therefore, it suffices 
to show that the conditions in Theorem |8.2| can be checked in polynomial time, 
when given on input an Point-algebra A. 

For the first condition, one computes the left and right monoids. These 
monoids are quotients of A under an equivalence relation that can be checked 
in polynomial time, and therefore they can be computed in polynomial time. 
Then one checks in polynomial time if the left and right monoids satisfy the 
identities for DA. 

Let us show how to check the second condition. A naive algorithm would 
check all possible subsets B C A, which would take exponential time. To 
overcome this, define an ordering ^ on the universe of A, such that a A b holds 
if there exist pointed words w,v G Point A such that 

a = mulA(w) b = mulA(r') 

and every letter that appears in w also appears in v, ignoring the underlining. 
The relation A is not necessarily transitive, due to taking the image under mulA- 
In terms of the second condition in the statement of the theorem says that 
for all but finitely many n, 

mulA(a"&ca"') = mulA(a"ca"') = mulA(a"c5a") for every a>h,c 

It is not difficult to show that the above need only be checked for n which are 
linear in the size of left and right monoids of A, and therefore the only remaining 
thing to do is compute 

It is not difficult to show that ^ is the smallest relation which contains every 
pair a < a and which satisfies the following implications for every a, 6, c, d in A. 


a <b 
a A b 

a A b and c A d 
a A b and c A d 


implies 

a A be 

implies 

a A be 

implies 

ac A bd 

implies 

gc :< bd. 


In particular, ^ can be computed in polynomial time using a fixpoint algorithm. 

□ 

The rest of Section |8.1| is devoted to proving Theorem |8.2| We begin with 
the easier implication. 


Lemma 8.4 If a unary query is two-variable definable, then its syntactic alge¬ 


bra satisfies the conditions in Theorem 8.2 


Proof. 

Let q C Points be a unary query definable in two-variable first-order logic, 
and let n be the quantifier depth of the defining formula. By Lemma |8.11 the 
equivalence relation is preserved under unary polynomials, and therefore by 
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the Syntactic Morphism theorem it is a congruence, i.e. the set of equivalence 
classes can be equipped a multiplication operation which makes it into a finite 
Point-algebra, call it A„. Since q is a finite union of equivalence classes under 
it is recognised by A„, and therefore the syntactic algebra of q is an image 
of A„ under a Point-morphism. The conditions in Theorem |8.2| are easily seen 
to be closed under images of Point-morphisms, and therefore it suffices to show 
that these conditions are satisfied by A„. We only sketch the proof for the 
second condition: this boils down to showing that if w € S* is a word which 
uses all letters in a set S C S, then 

w’^vibv 2 'w"^ w'^bw'^ for every m>n, vi,V 2 € B* and b G B. 


This is proved by induction on n. Here is one of the cases that needs to be 
considered: if in the first round, Spoiler moves the selected position of the first 
pointed word to some position in vi, the Duplicator responds by moving the 
selected position in the second pointed word to a position in the last copy of w 
before b which has the same label, such a position exists by assumption on w 
using all letters from B. □ 


The rest of Section |8.1| is devoted to showing the converse implication in 
A possibly partial function / : PointS —>■ X with X finite is 


Theorem 8.2 


called two-variable definable if the inverse image of every a; S A is two-variable 
definable. We will prove that if A is a finite Point-algebra that satisfies the con¬ 
ditions in the theorem, then every Point-morphism h : Points -> A is definable 
in two-variable logic. The proof is by induction on the size of the alphabet E. 
We begin by introducing some auxiliary results. 


Filtering. For a unary query q C Points, define 

filterg : (S* U PointS) —)■ (S* U PointS) 

to be the function which inputs a pointed or non-pointed word, and only keeps 
positions that are selected by q. The output is a pointed word if the input was 
a pointed word and q selected the selected position, otherwise the the output is 
a non-pointed word. For example, if g C Pointja, b} is the set of pointed words 
where the distinguished position has label a, then 

(filterg)(a6a) = aa € S* (filter( 7 )(aaa) = aqa G PointS. 

The following simple fact is proved by relativising formulas in the obvious 
way. 

Fact 8.5 If f : PointS —> X and q C PointS are definable in two-variable logie, 
then so is the partial function f o filter^. (The function is partial because it is 
undefined when filterg removes the selected position.) 
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A monoid. Let monA be the product of the left and right transformation 
monoids of A. It is not difficult to see that the left and right transformations 
of h{w) for w G Points do not depend on the selected position. In other words, 
there is a function monh which makes the following diagram commute 

Points ^ S* (6) 

mon/i 

Y 

monA 

where deselect is the function that ignores the selected position and mon is the 
function that computes the left and right transformation. It is not difficult to 
show that mon/i is a monoid morphism. By the assumption of the theorem, the 
monoid monA is in DA, as a product of two monoids in DA. 



Known results about DA. We now recall some results about sets of words 
(not pointed words) definable in two-variable logic. Let M be a monoid. Define 
the right ideal generated by m S M to be the set 

{mn : n G M} C M. 

We say m, n S M are 7^-equivalent, denoted by m n, if they generate the 
same right ideals. There is a symmetric notion of £-equivalence that uses left 
ideals. The theorem below summarises some results from |TW98| . It only men¬ 
tions 7?.-classes, i.e. equivalence classes under 7?.-equivalence, but the symmetric 
results also hold for £-classes. 


Theorem 8.6 For a monoid morphism 5 : E* —> M into a monoid in DA; 

1. every language recognised by g is a two-variable definable Boolean query, 
i.e. it is definable by a formula of two-variable first-order logic without free 
variables; 

2. for every TZ-class i? 0 / M there is a two-variable definable unary query 
which selects a position if and only if the prefix up to and including that 
position is mapped by g to R. 

3. for every to S M the set {n : mn ^ 7 ^ to} is a submonoid of M. 

Apply the above theorem to the morphism mon/i defined in diagram (§. 
By the second item of the theorem and its symmetric variant for £-classes, for 
every 7?,-class R and every £-class L of the monoid monA, there are two-variable 
definable unary queries, call them qn and qL, such that 

uav G qn iff monh(ua) G R 

uav G qL iff mon/i(au) G L 


holds for every m, u S E* and a G Ft. 

The more difficult implication from Theorem 8.2 will follow from the follow¬ 
ing lemma. 
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Lemma 8.7 Let R he an TZ-class in the monoid monA, and let L be an C-class 
in the monoid monA. Then the partial function obtained from h by restricting 
its domain to qr H ql is two-variable definable. 

Before proving the lemma, observe that it implies that h is two-variable 
definable. This is because every pointed word belongs to qn Ci pl for some 
choice of R and L. 

Proof. 

Call an 7?,-class minimal if the corresponding right ideal is minimal with respect 
to inclusion. Likewise we define a minimal /1-class. Consider three cases: when 
R is not minimal, when L is not minimal, and when both R and L are minimal. 
The first two cases are not disjoint. 


The 7?.-class R is not minimal. We prove a stronger result, namely the 
restriction of h to qn is two-variable definable. Let us decompose qn into a 
disjoint union of two unary queries, both of which are two-variable definable: 

selects the leftmost position that satisfies qn^ and q'^ selects the remaining 
positions. 


• Let us first show that h is two-variable definable when its domain is re¬ 
stricted to g^. Define gieft be the unary query which selects positions that 
are strictly to the left of some position that satisfies g^. Likewise define 
bright to be the positions that are strictly to the right of some position 
that satisfies g^. Because g^ is two-variable definable, then the queries 
gieft and gright are also two-variable definable. Consider some w G q^. The 
word deselects underlying w splits into three consecutive intervals: 

1. first come the positions that satisfy giett; 

2. then comes the single position that satisfies g^; 

3. finally come the positions that satisfy gright- 

Because the intervals are consecutive, when restricted to arguments from 
g^, the function h factors through the following three functions. 


monh o filtergieft 
h o filterg^ 
mon/i o filtergright 


PointE —> monA 
PointE —> A 
PointE -G monA 


The first and third functions are two-variable definable by Theorem 8.6 
and Fact |8.5[ The middle function depends only on the label of the selected 
position, and is therefore also two-variable definable. Therefore, h is two- 
variable definable when restricted to arguments from g^. 

• Let us now show that h is two-variable definable when restricted to q'^. 
The proof is the same as above, the only difference is in the proof that 


h o filtergj!^ : PointE — >■ A 


(7) 
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is two-variable definable. To prove this, let m be an element of the 7?,-class 
R, and consider the set 


M„ 


'= {n : mn ^ 7 ^ to} 


which is a submonoid of monA by item 3 of Theorem 8.6 Because M, 


is a submonoid, it is not difficult to show that it does not depend on the 
choice of TO S i?, i.e. it only depends on the 7?,-class R. Furthermore, this 
submonoid cannot be all of monA, since otherwise R would be a minimal 
7?.-class. Therefore, there is some a S E such that mon/i(a) does not belong 
to the submonoid, which means that 


mmonh{a) m for every m G R 

It follows that positions selected by cannot have label a. Therefore 
the function in 0 is two-variable definable by induction assumption on a 
smaller alphabet. 


The £-class R is not minimal. This case is symmetric to the previous one. 

Both L and R are minimal. We are left with the case where L is a minimal 
£-class of monA, and i? is a minimal 7^-class of monA. We will prove that, 
when restricted to pointed words in g/j H the value of h depends only on the 
label of the selected position. In other words, we claim that if 


W — U^leftH^right ‘C — "Cleftil^right 

are pointed words in qRf] ql with the same label o G E of the distinguished 
position, then both have the same value under h. Let u G E* be a non-pointed 
word which uses all letters in E, whose 7?,-class is R and whose £-class is L. 
Such a word exists by minimality, for example, u can be obtained from either of 
the pointed words w or by replacing the selected position by some non-pointed 
word which uses all letters from E. Let n be a sufficiently large number. We 
will show that 


h{w) = h{u^au^) = h{v). 

By symmetry, we only prove the left equality above. Because mon/i(i(;ieft) is 
in the same 7^-class as mon/i(r(;ieftM"), we have 

mon/i(t(;ieftw"')TO = mon/i(i(;ieft) for some to G monA. 

Therefore, there must exist some ccieft G E* such that wieitu^xieft induces the 
same left transformation in A as wieft. Using a symmetric reasoning for C- 
classes, we obtain some alright G E* such that h{w) is equal to 


aCright) 
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The elements mon/i(wieftu”) and monft,(u") are in the 7^-class R, because both 
have prefixes in this 7^-class and the class is minimal; and they are both in the 
£-class L because both have suffixes in this £-class. (Here we use minimality.) 
In a monoid from DA, or more generally in an aperiodic monoid, and element 
is uniquely determined by its £-class and 7^-class. Therefore, wieftw" and m" 
induce the same left transformation in A. By this observation, and a symmetric 
one for ^-classes, it follows that h{w) is equal to 

h{u XleftUXrightU ). 

From the assumption on A in the theorem, the above is equal to 

By the same reasoning, h{v) is also equal to the above, which completes the 
proof of the lemma. □ 

9 Monads for trees 

In this section, we present a series of monads for modelling trees. 

9.1 Ranked trees over a fixed alphabet 

We begin with a monad that represents finite trees over a fixed ranked alphabet. 
Algebras in this monad will be deterministic bottom up tree automata over the 
ranked alphabet. 

Consider a ranked alphabet S, i.e. a finite set where each element has an 
associated rank, which is a natural number. A ranked tree over such an alphabet 
is a finite tree labelled by S, where a node has as many children as the rank of 
its alphabet, and these children are ordered. In other words, this is a ground 
term over S seen as a signature. We define a monad Ts, which is parametrised 
by S, and which will model ranked trees over S. Although the alphabet S is 
ranked, the monad Ts itself is in the category of sets, i.e. sets without any arity 
structure imposed. 

Define Ts to be the monad which maps a set T to the set of terms over the 
signature S extended by variables from F (i.e. trees where labels from F can 
occur in the leaves). The multiplication operation 

muljj^r : T^TsF —)■ T^F 

is term substitution, while the unit maps a variable a G F to the term that 
consists only of this variable. In the language of category theory, this is the 
monad generated by E interpreted as a polynomial functor. 

If F is a finite alphabet, then a T^-language over F is a set of trees over the 
ranked alphabet E, extended by rank zero symbols for letters from F. In the 
special case of F = 0, a Ts-language over the empty alphabet is a set of ranked 
trees over the alphabet E. 
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Example 5. In this example, we show that when E contains only letters of rank 
one, then the monad Ts can be seen as modelling deterministic word automata 
with input alphabet E. Consider a ranked set E, which has only letters of rank 
one. If Q is a set, then elements of T^Q are trees with unary branching where 
inner nodes are from E and the unique leaf is from Q. For example an element 
of f,}!?} can look like this: 



When written bottom-up, such a tree can be seen as a word in Q • E*. A 
finite algebra Eilenberg-Moore for this monad, i.e. a finite T^-algebra, consists 
of a universe, call it Q, and a multiplication operation, which can be seen as a 
function 


6:Q-E*^Q. 

The associativity of multiplication says that 

S{S{q ■ w) ■ v) = S{q ■ w ■ v), 

and therefore S is uniquely defined by its values on Q ■ Yi. Stated differently, 
a Ts algebra is the same thing as a deterministic finite word automaton with 
input alphabet E, without designated initial and accepting states. □ 

Connections with E-algebras. Recall that in universal algebra, a E-algebra 
consists of a universe A together, together with an operation / : A" —A for 
each / S E of rank n. To go from a T^-algebra A in the sense of Eilenberg- 
Moore to a E-algebra in the sense of universal algebra, one defines the universe 
to be A, and the operation corresponding to a n-ary letter / S E to be 

(ai,... ,a„) G A” mulA(/(ai,... ,a„)). 

In the terminology of Section this is the E-reduct of A, where we view E as 
the (finitely complete and effective) subfunctor 

EA = {/(oi,..., a„) : / is an n-ary symbol in E} C T^A 

Every E-algebra is obtained this way, and therefore the two notions are essen¬ 
tially the same. This sameness extends to morphism^ 

® Actually, this sameness works for a more general notion of ranked set used in category 
theory, i.e. when S is an arbitrary functor. This more general setting can be used to describe 
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Connection with tree automata. As shown in Example if the alpahbet 
E contains letters of rank at most one, then T^-algebras are essentially the same 
thing as deterministic word automata. For other alphabets, the correspondence 
is with deterministic bottom-up tree automata. For a algebra A, there is a 
unique T^-morphism 


h : Te0 ^ A. 

When interpreting an element of Ts0 as a tree over the ranked alphabet, the 
algebra A maps every tree to an element of its universe. When the algebra 
is finite, this is the same thing as a deterministic bottom-up tree automaton, 
with the only difference being that an automaton also has an accepting subset 
of states, which indicates when a tree belongs to the language. Therefore, T^- 
recognisable languages are the same thing as the classical notion of regular 
languages of finite trees over the ranked alphabet E. 

Example 6. Consider the following variant of first-order logic on trees over a 
ranked alphabet E. To a tree t S T^T, one assigns a logical structure, where 
the universe is the nodes of the tree, and there are the following predicates: a 
unary predicate that is true in nodes with label a, a binary predicate for the 
descendant relation, and binary predicates for the i-th child relation for every 
i. A subset of T^T is called dehnable in first-order logic if there is a formula 
of first-order logic that is true in the logical structures corresponding to trees 
in the subset, and false in logical structures corresponding to tree outside the 
subset. A well-known open problem stated in [Tho84| is: can one decide if a 
recognisable language of trees is definable in first-order logic? 

Here we show the, already known, result that tree languages definable in 
first-order logic form a language pseudovariety. Since the assumptions of the 
Pseudovariety Theorem apply to the monad T^, this will imply that first-order 
definability of a tree language depends only on its syntactic algebra (whether 
or not this dependence is computable is the open problem). 

Recall that a language pseudovariety is a class of languages that is closed 
under Boolean combinations, polynomial derivatives, and inverse images under 
morphisms. (The other conditions in the definition are vacouous when there is 
only one sort, as is the case here.) Boolean combinations are for free in first- 
order logic. Closure under polynomial derivatives is shown that same way as 
closure under inverse morphisms, so we only show the latter closure. We need 
to show that for every T^-morphism 

h : T^r —)• T^A, (8) 

unranked trees, when S is the functor 

or unranked trees without sibling order, when S is the functor which takes X to finite multisets 
over X. A problem with these more general settings is that their Eilenberg-Moore algebras 
model automata that are too strong, in the sense that the transition function need not be 
describable in a finite way. 
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inverse images under h of first-order languages are also first-order definable. One 
way of proving this statement is to show that h is a special case of a more general 
notion of copying first-order interpretation^ and first-order definable languages 
are closed under inverse images of such interpretations. Another way, which 
we use here, is to use Ehrenfeucht-Fraisse games. Let us write s t if player 
Duplicator has a winning strategy in the n-round Ehrenfeucht-Fraisse game for 
the logical structures corresponding to s and t. It is easy to see that closure 
under inverse morphisms of first-order definable languages is implied by the 
following observation: 

s t implies h{s) h{t) for every n G N. (9) 

The above observation is proved using a straightforward strategy copying argu¬ 
ment, which we describe in more detail below. The key to the strategy copying 
argument is that every node in an image tree h{t) is uniquely identified by two 
pieces of information, which we call the origin and offset^ whose definition is 
explained by example in Figure]^ 

To prove in the game corresponding to h{s) and hit), Duplicator pre¬ 
serves the following invariant. 

(*) Suppose that i rounds have been played so far, and that the nodes selected 
in those rounds were Xi,... ,Xi in the tree h{s) and yi,... ,yi in the tree 
h(t). Then the offsets, if defined, are the same for each Xj and and 
Duplicator has a winning strategy for the remaining rounds in the game 
for s and t, assuming that the selected nodes Xi,..., and yi,..., are 
replaced by their origins. 

Assuming that s t holds, it is not difficult to show that Duplicator can 
preserve the invariant for n rounds in the game between h(s) and h{t). Al¬ 
though simple, the strategy copying argument is a bit delicate - as we will see 
in Example closure under inverse morphisms will fail in a different monad for 
modelling trees, where morphisms can duplicate subtrees. □ 

Dependence on E. In the monad Ts, there is a different monad for every E. 
In the following two sections, we present two approaches where the monad is 
independent of the alphabet. The price we will pay is using categories of ranked 
sets. 

9.2 Clones 

In this section, we consider a monad which is used to describe clones. We begin 
by recalling the definition of a clone from universal algebra: a clone over a 
universe A is a set of functions of the form A” —>■ A, of possibly different arities 
n G N, which includes all projections, i.e. functions of the form (oi,..., a„) i—>■ a^, 
and which is closed under composition in the sense that if the clone contains 
an n-ary operation / and k-axy operations /i,..., then it also contains the 


49 





Figure 2: The origin and offset functions. In this example, S has one symbol a 
of rank two, and one symbol c or rank zero. The origin function is from nodes 
of h{t) to nodes of t. The offset function is defined on the “new” nodes in h{t), 
i.e. those nodes in h(t) that have labels in F, and which have darker colour in 
the picture. The offset function maps such a node to the corresponding node 
h{a), where cr G F is the label of the origin. 
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fc-ary operation 


a ^ I—>■ f{fi{a),...,fn{a))GA. 

The category of ranked sets. To model clones by a monad, we use a dif¬ 
ferent category than sets. The category is ranked sets, i.e. sorted sets where 
the sort names are natural numbers. Recall that the notions of language theory 
are parametrised by notions of finite object and finite algebra. We make the 
following design decisions for the clone monad: a finite ranked set is one with 
finitely many elements, in particular only finitely many ranks can be achieved 
in a finite ranked set. We come back to the notion of finite algebra later on. 

The clone monad. The clone monad maps a ranked set S to the ranked set 
cloE, where elements of rank n are terms over E that use n variables xi,..., 
(the sequence of variables xi,X2, - ■ ■ is chosen so that they are fresh with respect 
to E). The terms need not use all variables, and variables may appear with 
repetitions. The monad multiplication operation 

/xs : clocloE —>• cloE 

is substitution, as illustrated in Figure]^ 

Comparison with clones. We use the name clo-algebra for an Eilenberg- 
Moore algebra in the monad of clones. A clo-algebra is almost the same thing 
as a clone in the sense of universal algebra, with the following differences. 

• Clones are more general than clo-algebras in the sense that clones admit 
a distinction between the universe and the operations of rank zero (con¬ 
stants). In other words, it is not necessarily the case that every element of 
a clone’s universe is a constant. (If this is the case, then a clone is called 
a polynomial clone.) 

• Clones are less general than clo-algebras in the sense that in a clone, un¬ 
like in a clo-algebra, there is an extensionality property with respect to 
the universe: elements of the clone are uniquely determined by the trans¬ 
formations that they induce on the universe. This is similar to the finite 
observability condition used in the Pseudovariety Theorem from Section]^ 

Therefore, a polynomial clone is the same thing as a clo-algebra that is zero- 
extensional in the sense every element is determined by its transformation on 
rank zero elements. 

Finitary clones. There is no sense in considering clo-algebras that have a 
finite universe, because the requirement on projections means that the universe 
is nonempty on every rank. In clo-algebras, we call a clo-algebra finite if it 
has finitely many elements for every rank, and is finitely generated. The fi¬ 
nite generation axiom is natural in the context of recognising languages (every 
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t s do do S {t) G do E 

t has 4 nodes and rank 0 H nodes and rank 0. 

Figure 3: Example of multiplication in a do-algebra. The ranked alphabet E 
has elements a,g,f of arities 0,1,2 respectively. The left picture represents 
a tree t G do do E, where variable Xi is used twice in the label of the root, 
which is drawn using parallel edges. This double use results in duplication after 
multiplication is applied. The light grey dotted circles on the right are not part 
of mulcioE(t), they just highlight how is obtained from t. 


recognisable do-language over a finite alphabet is recognised by a finitely gen¬ 
erated do-algebra), but it is not superfluous - there exist clones over a three 
element universe that are not fintiely generated, as shown by Yanov and Much- 


nik in |YM59j . and this is even the case for polynomial clones ADH83 


Example 7. This is a non-example of a pseudovariety. Let us revisit first-order 
logic on trees as defined in Example A language of ranked trees can be seen 
as a special case of a do-language, which happens to contain only elements of 
rank zero. Such languages are not closed under inverse images of clo-morphisms, 
which is witnessed by the following example, essentially due to Potthoff |Pot95] . 
(Recall that first-order definable language were closed under inverse morphisms 
for the monad of ranked trees, as shown in Example What worked in Ex¬ 
ample and no longer works in this example is that for clo-morphisms, a node 
is not uniquely determined by its offset and origin.) Consider letters 00 , 01,02 
with ranks 0,1, 2 respectively, and consider the do-morphism 


h : clo{oo,ai} —>■ clo{ao,a 2 } 
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which maps oq to oq, and which maps oi to the term 02 (xi, xi). This morphism 
sends trees that look like words to complete binary trees, as shown below: 



There is a first-order formula Lp that is true in complete binary trees of even 
depth, and false in complete binary trees of odd depth. The formula says that 
if one follows the unique path that begins in the root, and then turns left, right, 
left, right, etc., then one ends up in a leaf that is a left child. The inverse image, 
under the clo-morphism h, of the language dehned by ip is the set of trees over 
alphabet {oq, oi} which have even depth. This inverse image is not definable in 
first-order logic, and therefore first-order definable tree languages are not closed 
under inverse images of clo-morphisms. 

In particular, first-order logic does not form a pseudovariety of clo-languages. 
Therefore clones, or at least syntac tic clone s, are not the right tool to study first- 


order logic on trees. As shown in [EWOSj . his problem can be solved by using 


preclones, which are a variant of nonduplicating clones where every variable is 
used only once. The inadequacy of clones in this context is a bit of a shame, 
because clones have a better developed theory than preclones, e.g. Rosenberg 
classifies clones with a minimal set of operations that contains something other 
than projections [Ros86| or clones with a maximal set of operations that does not 
contain all operations |Ros70j . while Hobby and McKenzie classify congruences 
in a finite clone [HM88) . □ 


9.3 Forests of unranked trees 

We present a monad for modelling trees, which corresponds to forest alge¬ 
bra |BW08] . As in the previous two monads, the trees are finite (hnitely many 
nodes) and labelled (each node comes with a label). Unlike for the two previous 
monads trees are unranked, i.e. the number of children of a node is not deter¬ 
mined by its label, and can be arbitrarily large. We also assume that trees are 
sibling-ordered, i.e. the children of a node come with a total order. Finally, in¬ 
stead of trees it will be more convenient to talk about forests, which we define to 
be ordered sequences of trees, i.e. ordered sequences of trees that are unranked, 
labelled and sibling-ordered. Here is a picture of a forest: 

( 10 ) 
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Forests and contexts. The monad in this section will correspond to forest 
algebra. The principal idea behind forest algebra is to use two kinds of objects, 
namely forests and contexts. Forests have already been described above. A 
context is defined to be a forest with exactly one distinguished leaf, which is 
called the port of the contexl|^ Here is a picture of a context, with the port 
being labelled by xi'. 



The idea behind the port is that it can be replaced with a forest (or another 
context). One needs to be careful with the notion of replacement, because a 
port is a single node, while the forest that will replace it might have multiple 
roots, e.g. the forest in (10). The result of the replacement is that the all the 
roots of the inserted forest become children of the parent of the port, e.g. the 
result of replacing the port of (11) by the forest (10) is illustrated below, with 
the grey background indicating what used to be the port: 



The forest monad. We now define the monad for forests and contexts, which 
is called the forest monad. Line in all previously considered monads, the main 
idea is that one can replace any node with another element of the monad. In the 
forest monad, we will use the following discipline: leaves in a forest or context 
are be replaced by forests, while non-leaves are be replaced by contexts. This 
leads to a two-sorted alphabet: there are forest labels, which are found in leaves, 
and there are context labels, which are found on non-leaves. 

More formally, the forest monad, denoted by F, is in the category of two- 
sorted sets, where the sort names are “forest” and “context”. When applied to 
a sorted set S, the forest monad F yields the following sorted set FE 

• on the forest sort, FE contains nonempty forests labelled by E such that 
leaves are labelled by letters of “forest” sort, while non-leaves are labelled 
by letters of “context” sort; 

• on the context sort, FE contains contexts labelled by E in the same way 
as in the previous item. 

® One could consider a variant of this monad without the requirement that the port appears 
in exactly one leaf, we keep this requirement so that the monad ends up describing forest 
algebra introduced in IBW08I . Furthermore, allowing ports in many leaves would break the 
argument in Example!^ actually first-order logic would no longer be a pseudovariety. 
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The unit operation in the monad F maps a forest element a to unit forest that 
looks like this 



and maps a context element a to a unit context that looks like this 



The multiplication operation in the monad is based on the intuitions of replace¬ 


Figure|^ 


ment depicted in pictures (10 1 , (111 and (12). The operation is illustrated in 




mulFi;(t) S FE 


Figure 4: Example of multiplication in the forest monad. Before multiplication, 
t has two context nodes Vi and and two forest nodes V 2 and v^. After 
multiplication, t has fourteen nodes, which correspond to the non-variable nodes 
in the labels oi vi,... ,V 4 . Note how the x in the label of node vi is replaced by 
three nodes, namely the two roots of V 2 and the one root of U 3 , resulting in a 
change of the number of children for node v. 


A finite alphabet is a two-sorted set finite that is finite on both sorts, and a 
finite f-algebra is one whose universe is finite. 
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Lemma 9.1 Every F-algebra is spanned by the subfunctor Fq which maps E to 


0 @ 


where a,b are context elements ofH, and c,d are forest element ofT,. 

Proof. 

The lemma boils down to the following easy fact. Every forest or context can be 
built out of units forests and unit contexts by the following operations: replacing 
the port of a context by another context or a forest, and concatenating two 
forests. □ 

By the above lemma, every F-algebra is uniquely determined by its Fg-reduct. 
This reduct is exactly the same thing as a forest algebra from [BW08| . in the 
variant of forest algebra where there is no empty forest or context. One advan¬ 
tage of seeing forest algebras as a special case of monads is the we can apply the 
general theorems from the first part to see that forest algebra has a syntactic 
morphism theorem (already known) or a pseudovariety theorem (not present in 
the literature). 

Example 8. Let us revisit first-order login on trees, as considered in Examples]^ 
and[^ To a forest one can assign a logical structure, where the universe is the 
nodes of the forest, there are unary predicates for the labels, and two binary 
predicates for the descendant and document orders (document order is the order 
in which nodes are visited in depth first search, which takes into account the 
order on siblings). Eor a context, the structure is defined the same way, except 
there is constant which denotes the port. A language L C FS is called first- 
order definable if it can be dehned by a formula of first-order logic in terms 
of the logical structure defined above. One can show that first-order definable 
language form a pseudovariety. The interesting case is to show that if 

/i : FE ^ FT 



is a F-morphism, then 

t '^n s implies h{t) h{s) for every n G N, 

where says that the corresponding logical structures have the same first-order 
theory of rank n. This is proved using the same origin and offset argument as 
in Example □ 
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9.4 A monad for infinite unranked forests. 

The w-forest monad, denoted by X i—)■ ojfX, is defined like the monad F, with 
the difference that infinite forests and infinite contexts are also allowed (assume 
finite branching, though). The problem with this monad is that it is unclear 
what finite algebra should be in this case. Clearly, the algebra needs to be finite 
on both sorts, but this is not sufficient, as the following example shows. 

Example 9. Consider an alphabet S in the sense of the monad wF, i.e. an 
alphabet with elements of sorts “forest” and “context”. Let L be an arbitrary 
set of trees over the alphabet L, not necessarily MSO dehnable. Define denseL 
to be those forests where every node has a some descendant with a subtree in 
L. We claim that denseL is recognised by an wF-algebra A with a four element 
universe. There forest sort has elements “forests in L” and “forests not in L”. 
The context sort has elements: “every node outside the port path has some 
descendant with a subtree in L” and “some node outside the port path has 
no descendants with a subtree in L”; where the port path is defined to be the 
ancestors of the port. The dependence on L in the algebra A is seen in the 
multiplication operation 

mulA : ujfA —>■ A 

which maps infinite objects to elements of the universe A. In particular, there 
are uncountably many wF-algebras with finite universes, and there is no hope 
of representing them in a finite way. □ 

As witnessed by the above example, the notion of hnite algebra should have 
some additional requirements. Let us make the design decision that languages 
recognised by finite algebras should be exactly those that can be defined in 
MSO. The question of finding an adequate notion of finite wF-algebra is a monad 
formulation of an open problem in the community of algebraic language theory, 
namely the problem of a finding an algebraic model for MSO on infinite tees. 
The fact that we use monads, or that the trees are unranked, does not seem to 
be important. 

A simpleminded solution is to define an wF-algebra A to be finite if its 
universe is finite on both sorts, and the multiplication operation is MSO definable, 
in the sense that every language 

mul^^(a) C wVA with a G A 

is MSO definable. Adjusting for a different terminology, this is the solution pro¬ 
posed in |BI09] , where it is shown that syntactic algebras can be computed, one 
can check if an algebra satisfies given equalities, and the algebras can be used 
to decide questions such as “is a given language of infinite trees definable in 
the temporal logic EF?”. This definition of finite algebra is compatible with the 
results from Part I, in particular with the Syntactic Morphism Theorem and the 
Pseudovariety Theorem. Examples of language classes that are pseudovarieties 
include: languages defined in weak MSO, i.e. only using existential quantih- 
cation over finite sets; languages recognised by nondeterministic (respectively, 
alternating) tree automata that use parity ranks from a given subset D C N. 
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10 Future work 


This section sketches some potential monads to study in the future, with reasons 
for studying them. 

• Unranked trees with possibly infintie branching (or graphs, which should 
not make a difference) modulo bisimulation. The hope would be that 
recognisable languages, under a suitably chosen notion of finite algebra, 
would be the same thing as definable in /r-calculus. 

• Edge labelled hypergraphs. This looks like a monad, because a hypergraph 
with n distinguished port vertices can be substituted for a hyperedge of 
rank n, in the same spirit as Figure The hope would be to describe tree 
width or clique width as submonads generated by finite subfunctors (as 
defined in Section]^. 

• Typed terms of A-calculus with fixpoints, modulo equivalence. The hope 
would be to describe the work of Salvati and Walukiewicz. 

• Relations on words with origin information, as a generalisation of trans¬ 
ducers with origin information from |Bojl4| . The hope would be to give an 
algebraic framework for asynchronous relations on words with origin. The 
origin information would cure problems like no syntactic object, or un¬ 
decidability of universality, which plague asynchronous relations without 
origin. 
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Part III 

Profinite Monads 


In this part, we show that for every monad T, at least in the category of sets, 
there is a profinite version T. This gives immediately definitions, and basic 
theorems about, things like profinite words, profinite countable chains, profinite 
trees, etc. We also study the special case of profinite words, and show how the 
generic notion of recognisable language instantiates to an interesting class of 
languages of profinite words. 
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11 Stone duals and topology on an algebra 

Profinite constructions are common in mathematics. For recognisable languages, 
the best known prohnite construction is the semigroup of profinite words. In 
this section, we show how profinite are defined on the abstract level of monads. 
The main results of this section are: 

• Lattices of languages are exactly those families of languages that can be 
defined by profinite implications. This generalises to monads a result 
that was proved for semigroups in |GGP08| . As a corollary, we get a 
monad generalisation of the Reiterman theorem [Rei82] . which says that 
pseudovarieties are exactly those families of languages that can be defined 
by profinite identitites, which are a stronger form of profinite implications. 

• Every class of languages, e.g. context-free or decidable, can be used to 
yield get some kind of profinite object, but only recognisable languages 
can be used if we want algebraic operations to be uniformly continuous. 
These results are monad generalisations of results that were proved for 
semigroups in [GGPlOj . 

11.1 Stone duals of Boolean algebras 

In this section, we recall the definition of the Stone dual of a Boolean algebra, 
and how Stone duals can be used to characterise lattices. Section 111.11 does not 
talk about monads. 

Consider a Boolean algebra 


(A,n,u,^. 

Define an ultrafilter in A to be a proper subset U C A which is closed under 
intersections, and which contains every element of A or its complement but not 
both. The Stone dual of A, denoted by StoneA, is defined to be the following 
topological space. The points in the space are ultrafilters in the Boolean algebra. 
The topology is generated by base open sets which are of the form 

a = {[/:[/ is an ultrafilter containing a} for a G A. 

The topology on the Stone dual is known to compact and Hausdorff. One of 
the advantages of the Stone dual is that it can be used to describe lattices of 
languages, including the special case of pseudovarieties of languages. 

Profinite implications. We begin by repeating a result from [GGP08] . which 
says that for an arbitrary Boolean algebra, lattices are exactly those sets which 
are defined by profinite implications. (This terminology is different than |GGP08j , 
which uses the name “equation” for what we call an implication.) 

Consider a Boolean algebra A. (In the context of this paper, it is convenient 
to think of A as being all recognisable subsets of TE. In this case, elements of 
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the Boolean algebra are themselves sets.) A profinite implication over a Boolean 
algebra A is an expression of the form w ^ v, where w,v € StoneA. The arrow 
is just part of the syntax, so formally a profinite implication is simply a pair of 
elements from StoneA. An element a G A (e.g. a recognisable language, when 
the Boolean algebra consists of recognisable languages) is said to satisfy the 
implication ii a G w implies a G v. A subset B <G A (e.g. a family of recognisable 
languages when the Boolean algebra consists of recognisable languages) is said 
to be defined by a set of profinite implications if it contains exactly the elements 
that satisfy all profinite implications from the set. In the following lemma, B 
is called a lattice if it contains the 0 and 1 in a Boolean algebra, and is closed 
under finite unions and finite intersections. The following theorem can be found 
implicitly in |GGP08) . and maybe earlier as well. 

Theorem 11.1 Let A be a Boolean algebra and let B Q A. Then B is a lattice 
if and only if it is definable by profinite implications. 

Proof. 

It is easy to see that if a subset of A is definable by profinite implications, then 
it is a lattice. To prove the other implication, consider a lattice B C A. We 
claim that B is defined by the set of profinite implications: 

{a; —)■ y : for every a G B, A a G x then a G y}. (13) 


By definition, every element of B satisfies the profinite implications above. Let 
then a S A be such that a satisfies all the profinite implications above. To prove 
the theorem, we will to show a G B. 

Recall the definition of d in the definition of the Stone dual, which is that d 
is the set defined by o S a; iff cc S d. We first claim that every x G d satisfies 

X Gb C d for some b G B (14) 

For X G d, define [a:] C StoneA to be the intersection 

n 

b&Bnx 


It is easy to see that [a:] is the set of all y € A such that the profinite implication 
X ^ y belongs to the set (131. By assumption that a satisfies all these profinite 
implications, it follows that [a;] C d. Note that [a;] is an intersection of sets that 
are closed. By compactness of the Stone dual, [a:] is equal to an intersection 
of finitely many b with x G b G B. Furthermore, the intersection is nonempty, 
because B contains the greatest element of the Boolean algebra, being a lattice. 
Because B is closed under finite intersections, it follows that [a;] = b for some 
b G B with X Gb. Together with [a:] C d, this proves (14|. 

From (14) it follows that d is the union of all b ranging over b G B such 
that b C d. By compactness, the union can be made finite, and by closure of B 
under finite union, it follows that there is some b G B such that d = h. Finally, 
since a, b are in the Boolean algebra, it follows that a = 6. □ 
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11.2 Stone duals of T-algebras 

Section [ll.l| did not use monads and recognisability. In this section, we consider 
the special case of Stone duals of recognisable languages in a T-algebra. Using 
this Stone dual, we prove a monad version of the Reiterman theorem, which 
says that a class of recognisable languages is a language pseudovariety if and 
only if it can be defined by profinite identities. 

The Stone dual of a T-algebra. Fix a monad T in the category of sets. We 
assume that finite alphabets are finite sets, and finite algebras are algebras with 
finite universes. The results can be easily generalised to sorted sets. Let A be a 
T-algebra, not necessarily finite. Define recA to be the subsets of the universe 
of A that are recognised by T-morphisms from A into finite T-algebras. Since 
recA is a Boolean algebra, it has a Stone dual, which we denote by StoneA. 

Example 10. Consider the monad of finite nonempty words, where -|—algebras 
are semigroups, and -I—morphisms are semigroup morphisms. Consider the semi¬ 
group where E is a a finite alphabet. An element of StoneE+ is an ultrafilter 
in the Boolean algebra of recognisable languages over E. Recalling the definition 
of an ultrafilter, an element of StoneE+ is a family of recognisable languages 
over E, which is closed under intersection, and which contains every recognisable 
language or its complement. 

A simple example of such an ultrafilter is one that is induced by a word w S 
S+, namely the ultrafilter of recognisable languages which contain w. Stated 
differently, StoneE+ can be seen as a generalisation of E+. 

Here is a more exciting ultrafilter, which corresponds to taking the idem- 
potent power of a finite word. Recall the well known fact that in every finite 
semigroup A of size n, the function a i—>■ a"' maps every element of A to an 
idempotent, i.e. 


and this element is the unique idempotent power of a, i.e. if 

implies = a^'. 

We write for this idempotent power. A common notation would be w‘^, 
but choose # to avoid conflict with the co power in infinite words. For every 
semigroup morphism h : E’*' —> A, and every w € E"*", we have 

h{w^') = h{w)'^ for all but finitely many n. 

This implies that for every recognisable language L C E+ and every word w G 
E"*", either L contains w^' for all but finitely many L; or L does not contain w"' 
for all but finite many L. This in turn implies that the set of languages 

{L C E+ : L is recognisable and ic"' S L for all but finitely many n} 
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is an ultrafilter, which we denote by . □ 


Running Example 8. Consider the monad of oo-words used in the running 
example. As for semigroups, for every (X)-algebra A and every a S A there is 
a unique ide mpotent power . Let w G S'*" is a finite nonempty word. As in 
Example 10 one can also dehne profinite oo-word namely the ultrafilter 

{L C : L is recognisable and w^' G L for all but finitely many n}. (15) 


Note that in the monad for oo-words, the notation w‘^ stands for an actual 
infinite word, which can then be treated as a profinite word, i.e. 


{L C 


: w 


GL} 


(16) 


□ 

Theorem [TTt] can be applied to StoneA; for instance if the monad is the 
monad of finite words, and A is E+, then Theorem 11.1 says that a family of 
recognisable languages over E is a lattice if and only if it is definable by a set 
of prohnite identities. 


Running Example 9. Consider a language L C E°°. Define the first differ¬ 
ence distance between two oo-words to be zero if they are equal, and otherwise 
to be 1/n where n is the first position where the words have a different label. 
Define a safety language to be a set of oo-words which is closed under limits 
with respect to first difference distance. In other words, safety says that if w 
is an infinite word such that every finite prefix of w can be extended to some 
word from the language, then w itself belongs to the language. 

It is easy to see that recognisable safety oo-languages form a lattice, and 
therefore by Theorem |11.1| they must be characterised by a set of profinite 
implications. One can show that the set of profinite implications is 

vw'^u -G vw'^, (17) 


where v,w,u G E+ and the powers ^ and are understood as profinite words 
in the same sense as in (15) and (16), i.e. the two sides of the above profinite 
implication are the following ultrafilters, respectively. 


{L C E°° : L is recognisable and vw'^'u G L for all but finitely many n} 
{L C E~ : vw‘^ G L} 


Indeed, suppose that L is a safety language, and it satisfies the left side of 
the profinite implication for some v,'w,u, which means that it contains vw'^'u 
for almost all n. By safety, the language L must also contain the limit of the 
sequence vw'^'u, which is , and therefore L satisfies the right side of the 
profinite implication. The more interesting case is the converse, i.e. showing 
that if L satisfies all profinite implications of the form 0, then it is a safety 
language. To prove that L is a safety language, assume that it contains all words 


Wl, W2, ■ ■ • 
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which tend, under first difference distance, to some word w. We need to show 
that L also contains w. If w is finite, then all but finitely many of the words Wi 
are equal to w, and therefore w € L. Assume therefore that w is infinite. By 
the Ramsey Theorem, w can be factorised as 

W = V0V1V2 ■ ■ ■ 

such that all word vi,V 2 , ■ ■ ■ have the same image under some oo-morphism 

h : ^ A 

which recognises L. This means that for every i, all but finitely many of the 
words Wj have a prefix of the form VqVi ■ ■ - Vi. Without loss of generality, we 
may assume that 


Wi = VqVi 


■ ViUi, 


and also without loss of generality we may assume that all words Ui have the 
same image under the oo-morphism h. Since L is recognised by h, it follows that 
L contains all words of the form uo(ui)”ui. By (17), L also contains Vo(vi)‘^, 
which has the same image under h as w, and therefore L also contains w. □ 


Defining pseudovarieties by identities As mentioned in its proof. Theo¬ 
rem [nT] does not use any properties of recognisability over a monad. We now 
present a corollary of the theorem, which is more specific to monads, and which 
says that pseudovarieties can be defined by identities. 

To define profinite identities, we observe that both T-morphisms and unary 
polynomials can be naturally lifted to profinite objects, as described below. 
Suppose that 


/: A^B 


is a function, not necessarily a T-morphism, which has the property that recog¬ 
nisable languages are preserved under inverse images of /, i.e. 

L € recB implies f~^(L) € recA. (18) 

Then for every ultrafilter U of recognisable languages subsets of A, the family 
(Stone/)(C/) = {L e recB : /"^(L) S U} 
is an ultrafilter of recognisable subsets of B. In other words, / lifts to a function 

Stone/ : StoneA —> StoneB. 


One can show that the mapping Stone defined this way is a functor, whose 
domain is the category of T-algebras with functions that satisfy (18). We will 
be interested in two special cases of functions / with property (18), i.e. when 
/ is a T-morphism and when / is a unary polynomial. The following fact is an 
immediate consequence of the definitions. 
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Fact 11.2 /// ; A — 7> B satisfies (181, and L C B, then 


f ^{L) satisfies w ^ v iff L satisfies (Stone/) (w) — > (Stone/) (z;) 

Define a profinite identity to be an expression of the form w = v where 
w,v G StoneTA for some finite set X of variables. As in profinite implications, 
the equality sign is just part of the syntax, and formally a profinite identity is 
simply the pair {w,v). If A is a T-algebra, then we say that L G recA satisfies 
a profinite identity re = r; if it satisfies 

(Stone(p o h)){w) gg (Stone(p o h)){v) 

for every unary polynomial p G polj^A and every T-morphism : TA —> A, 
where GG means that the profinite implication is satisfied both ways. As men¬ 
tioned above, the mapping Stone is a functor, and therefore Stone(p o h) is the 
same as (Stonep) o (Stone/i). Intuitively speaking, for every substitution of the 
variables, i.e. every morphism li, and in every environment, i.e. for every unary 
polynomial p, the two sides of the profinite identity are equivalent. 

Example 11. Consider the monad of finite words, and the profinite identity 

xy = yx. 

Formally speaking, the profinite identity uses profinite words, call them “xy” 
and “yx”, which correspond to the finite words xy and yx, as described in the 
second paragraph of Example [T^ A recognisable language L C 1]+ satisfies this 
profinite identity if 

p~^L G “wx” iff p~^L G ‘W” 

holds for every unary polynomial p over E"*" and every w,v G S+. Unraveling 
the definitions of the profinite words “xy” and “yx”, this means that 

wvGp~^L iff vwGp~^L. 

This means that the language must be commutative. □ 


Example 12. Consider again the monad of finite words. Recall the profinite 
word that was described in Example 10 In a similar way, we can define a 
profinite word to be the ultrafilter 

{L C E“'' : L is recognisable and G L for all but finitely many n}. 


Consider the following profinite identity over a single variable x: 

X# = x^'*'^, 


which the reader might recognise as the identity defining aperiodic semigroups. 
We now check that this is the case under the definitions of this section. A 
recognisable language L C E"*' satisfies this profinite identity if for every unary 
polynomial p over the semigroup E+ and every w G E+, the following conditions 
are equivalent 
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• w"' belongs to p~^L for all but finitely many n; 

• belongs to p~^L for all but finitely many n. 

This implies that for every word w and unary polynomial p, the language p~^L 
contains either finitely many, or all but finitely many, of the powers re”. This 
means that the syntactic semigroup of the language is aperiodic, which means 
that the language is definable in first-order logic, by Schiitzenberger’s theorem. 
□ 

The above two examples showed that, in the monad of finite words, some 
classes of recognisable languages can be characterised via profinite identities. 
The following theorem, which is a monad version of the Reiterman Theo¬ 
rem |Rei82| . says that this is the case for all pseudovarieties, although infinite 
sets of identities might need to be used. Note that although profinite identi¬ 
ties can be evaluated in a recognisable subset of an arbitrary T-algebra, in the 
following theorem we talk only about T-languages, i.e. recognisable subsets of 
algebras of the form TE where E is a finite alphabet. 

The following theorem uses the polynomial variant of language pseudovari¬ 
eties that is mentioned in Sect ion [4^ i.e. this is a class of recognisable languages 
that is closed under polynomial derivatives, inverse morphisms, and Boolean 
combinations. 

Theorem 11.3 Let L be class of recognisable T-languages. Then h is a pseu¬ 
dovariety if and only if it is defined by a set of profinite identities. 

Proof. 

The right-to-left implication is essentially checking the dehnitions, while the 
left-to-right implication is a corollary of Theorem [TTt} 

Right-to-left implication. Suppose that / is a set of profinite identities, 
and let L be the set of recognisable T-languages which satisfy all of these iden¬ 
tities. We need to show that L is a pseudovariety. As mentioned in the proof 
of Theorem [mi satisfying profinite implications is closed under unions and 
intersections. It is easy to see that satisfying a profinite identity is invariant 
under complementation. Therefore, L is closed under Boolean combinations. It 
remains to show that L is closed under inverse images of morphisms and un¬ 
der polynomial derivatives. Let then L C TT be a language that satisfies all 
identities from /, and suppose that 

h : TE ^ TT 

is a T-morphism. We will show that h~^{L) C TE also satisfies all identities 
in I. (The proof for polynomial derivatives is the same and is ommitted.) By 
definition, we need to show that for every profinite identity w = v in I which is 
over variables X, and every 

/ : TA ^ TE 9 G pol^TE 
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which are a T-morphism and unary polynomial, respectively, we have 
h~^{L) satisfies {Stone{q o f ))(w) {Stone{q o f)){v) 


By Fact 11.2|and functoriality of Stone, the above is equivalent to saying that 


L satisfies (Stone(/i o g o/))(rc) o (Stone(/i o g o/))(ti), 


which is the same as saying that 

L satisfies (Stone(r o h o /))(zi;) e-)-(Stone(r o h o/))(«), 

where r S pol^Tr is the image of q under h, see Q. Since h o f is itself a T- 
morphism, the above holds by assumption that L satisfies all profintie identities 
from L. 


Left-to-right implication. Let L be a class of recognisable T-languages 
which is a language pseudovariety. We need to show that L is definable by 
profinite identitites. For a finite alphabet S, define Ls to be all languages from 
L over alphabet E, and let /s be the set of profinite implications that are satis¬ 
fied by all languages in L^. Define / to be the set of profinite identities w = v 
such that some Iy. contains the profinite implication w ^ v. We show below 
that L is defined by I, i.e. a language belongs to L if and only if it satisfies all 
identities in I. 


Suppose that L satisfies all profinite identities from I. In particular, this 
means that L satisfies all profinite implications from Iy ■ Since L is a pseu¬ 


dovariety, it follows that Ls is a lattice, and therefore by Theorem 11.1 


Ls is defined by the profinite implications from Iy, which means that L 
belongs to L^. 


• Suppose that L belongs to L. We need to show that L satisfies all profinite 
identities from I. In other words, we need to show that if tc —>■ u is a 
profinite implication from Jp, then 


L satisfies {Stone{p o h)){'w) {Stone{p o h)){v) 

for every unary polynomial p S polj^TE and every T-morphism h : TF —>■ 
TE. By Fact EH and closure propeties of a pseudovariety, this boils 
down to the profinite implications w v being satisfied by a language 
from Lp, which holds by definition of /p. 


□ 


11.3 Uniform continuity 

The results in Section |11.1| a nd |12.4| did not really use assumptions on recog- 
nisability. Actually, Theorem |11.1 would also be true for non-recognisable lan¬ 
guages, as shown in the following example. 
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Example 13. Consider the monad of finite words. For the purpose of this 
example, defin StoneS+ to be the Stone dual of the Boolean algebra of decid¬ 
able languages over the alphabet E, as opposed to the recognisable languages 
considered in the previous secition. Also for the purpose of this example, de¬ 
fine a pseudovariety to be a class of decidable languages that is closed under 
Boolean combinations, inverse morphisms and polynomial derivatives, e.g. the 
polynomial time complexity class P is such a pseudovariety. Inspection of the 
proofs in Section En] shows that Theorem |11.3| would also work in this setup, 
in particular P is definable by profinite identities. □ 

In this section, we show that recognisable languages are special in some 
sense. The result in this section is a generalisation of Theorem 4.1 in |GGP10) 
from semigroups to a certain class of monads over sets. 

Uniformly continuous operations. We begin by defining the notion of a 
uniformly continuous operation in a T-algebra, with respect to a chosen class of 
languages. Let 

£ = {Li,L2, ■ ■ ■} 

be a countable family of subsets of a set A, along with some enumeration. We 
say that L G C separates two elements of A if it contains exactly one of them. 
Define the /1-distance on A to be 

dist£(a, 5) = — where n is minimal such that separates a,b. 

2 ” 

It is easy to see that this is a distance, assuming that every two elements of A are 
separated by some element of C. Note how countability is used in the definition. 
Unravelling the classical definition of uniform continuity, a function 

/ : A" ^ A 

is uniformly continuous with respect to £-distance if for every finite set A C £ 
there is some finite set A4 C C such that 

f\vi=MWi implies f{vi,...,Vn)=Kf{wi,...,Wn) 
i 

where =k: says that elements cannot be separated by languages from /C, and 
=M is the analogous equivalence but lifted pointwise to functions. It follows 
that although the definition of £-distance depends on the enumeration of £, the 
notion of uniformly continuous function does not. 

The goal of this section is to investigate conditions on £ which guarantee 
that all polynomials of finite arity define uniformly continuous functions. The 
answer will be that £ needs to contain only recognisable languages. We begin 
by two examples, which show the result for the special case of semigroups. 

Example 14. Let S be a finite alphabet, let A be a set of finite semigroups 
(e.g. all finite semigroups, or all aperiodic semigroups), and consider the £- 
distance on E+ where £ is all subsets of E+ recognised by semigroups in X. We 
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claim that concatenation, which can be seen as a binary polynomial 

{w, v) I—t wv 

is uniformly continuous with respect to >C-distance. We need to show that for 
every finite 1C C C there is some finite A4 C C such that 

wi =M W 2 and vi =m V 2 implies wiUi =c W 2 V 2 - 

holds for every words wi,W 2 jVi,V 2 C The languages A4 can be taken 

to be all languages recognised by those semigroups that are used to recognise 
the languages from K,. The family Ai is finite because there are finitely many 
possible semigroup morphisms from to a finite set of finite semigroups. 
The same solution works for other operations in E"*" that can be built using 
concatenation. □ 

Example 15. As in the previous example, consider the ^-distance on E+ 
where C contains some language that is not recognisable. We show that with 
respect to /1-distance, concatenation might continuous, but not uniformly con¬ 
tinuous. 

To show that concatenation might be continuous, suppose that C contains 
all singleton languages, e.g. C is the decidable languages. This implies that 
the topology generated by /1-distance is discrete, because every singleton set is 
open. Therefore, the topology on finite powers of E+ is also discrete, and thus 
concatenation is continuous with respect to /1-distance, like any other operation 
on this semigroup. 

Let us now show that concatenation is not uniformly continuous. Consider 
some non-recognisable language L G C. We will show that there is no finite set 
M of decidable languages such that 

wi =M W2 and vi =m V2 implies wiVi ={l} W2V2- 

for every words wi,W 2 ,vi,V 2 G E+. Let then be a finite set of languages 
from C, or any languages for that matter. Because L is not recognisable, there 
are infinitely many left derivatives, i.e. languages of the form x~^L. Since there 
are finitely many equivalence classes of =m^ there must exist some two words 
Wi , W2 such that 

wi =M V 2 and Wi^L ^ w^'^L. 

The inequality of derivatives means that there is some v such that 

WlV W 2 V, 

which proves that concatenation is not uniformly continuous. □ 

The two examples above are essentially Theorem 4.1 of [GGP10| . The goal 
of this section is to generalise that result to algebras over abstract monads. 
The role of concatenation will be played by polynomials of finite arity. In our 
generalisation we assume that the monad is finitary, and that it is over the 
category of (unsorted) sets. The proof can be easily generalised to finitely 
sorted sets. There is one additional assumption in our generalisation, which 
will require some more definitions. 
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Observationally complete polynomials. The idea behind observational 
completeness is that sometimes, instead of using all unary polynomials in the 
definition of the syntactic congruence, one can use a smaller subset, e.g. unary 
polynomials that use the variable only once (whatever that may mean in an 
abstract monad). 

Let A be a T-algebra. We write pol„A for polynomials with n argumetns in 
the algebra A; we do not need to indicate the sorts of these arguments because 
we use unsorted sets. We use the convention that the variables in a polynomial 
from pol^A are called cci,..., Xn- Therefore, formally 

pol„A = T(AU {a;i,... ,a;„}). 

A set P C poIjA is called observationally complete for A if the following con¬ 
ditions are equivalent for every a, b in the universe of A and every subset L of 
the universe of A: 


wGp iff w'^p^L for every p G pol^A (19) 

wGp~^L iff w'Gp~^L for every p G pol^A n P. (20) 


Recall that the condition in (191 is the equivalence relation defined in the proof 
of the Syntactic Morphism Theorem. 


Example 16. Consider the monad of finite words where algebras are semi¬ 
groups. Call a unary polynomial nonduplicating if it uses its variable exactly 
once. Such a polynomial is of the form wxiv where w^v are possibly empty 
words over the universe of the semigroup. Without loss of generality one could 
also assume that w, v have length zero or one. It is not difficult to show that the 
unary nonduplicating polynomials are observationally complete in every semi¬ 
groups. □ 


Running Example 10. Consider the monad of ultimately periodic oo-words. 
It is not difficult to show that in every algebra A for this monad, an observa¬ 
tionally complete set of unary polynomials is 

{wxiv,w{xiv)‘^ : where w, u G A*}. 

These unary polynomials correspond to the Arnold congruence from |Arn8fi| . 

□ 


Finite covers. Define an n-ary term to be an element of T{a;i,... ,x„}, where 
Xi,... ,Xn are the variables used for polynomials. In other words, a term is the 
special case of a polynomial that does not use any constants, and therefore 
an n-ary term is an n-ary polymomial in every algebra. We say that a unary 
polynomial p G pol^^A is covered by an n-ary term q if there exist 02 ,..., o„ in 
the universe of A such that 


p = q{xi,a2,...,an). 
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A set P C poIjA is said to have a finite cover, if there is a finite set Q of terms, 
of possibly different arities, such that every polynomial in P is covered by some 
term in Q. 


Example 17. The nonduplicationg polynomials mentioned in Example 
are covered by the 3-ary term 12 X 10 : 3 . The nonduplicationg polynomials men¬ 
tioned in Running Example 10 are covered by the two 3-ary terms X 2 X 1 X 3 and 
X 2 (xiX 3 )“. Summing up, in both these monads, every algebra has a finite cover 
for the set of nonduplicating unary polynomials. □ 


Characterisation of uniformly continuous term operations. We are 

now ready to state the theorem that characterises recognisability as a neces¬ 
sary and sufficient condition for uniform continuity of term operations. In the 
theorem, we write der£ for the set of all polynomial derivatives of languages 
from C. 


Theorem 11.4 Consider a finitary monad T in the setting of sets. Let A. he a 
finitely generated T -algebra which has an observationally complete set of unary 
polynomials that has a finite cover. Let C he a countable family of subsets of the 
universe of A. Then 

1. if C contains only T-recognisable languages, then every term operation is 
uniformly continuous for derC-distance; 

2. if C contains at least one language that is not T -recognisable, then some 
term operation is not uniformly continuous for C-distance. 

Before proving the theorem, note that by the discussion in Example |17[ the 
assumptions of the theorem are satisfied by every algebra in the monad of finite 
words, and by every algebra in the monad of ultimately periodic words. 

Proof. 

We skip the proof of the first item, which is proved as in Example and does 
not use the assumption on the observationally complete set of unary polynomi¬ 
als, but uses the assumption on finite generation. 

Let us consider the second item. Let P be a set of observationally complete 
polynomials with a finite cover Q, as in the assumptions of the theorem. Let 
L G Che some language that is not recognisable. Since A is finitary, we can use 


the Syntactic Morphism Theorem. It follows that the equivalence in (19) has 


infinite index, and therefore the equivalence relation defined in (201 as applied 
to P has infinite index. For a unary polynomial q in the finite cover Q, define 


to be the relation as in ( 20 ) but with polynomials restricted to those that 
are covered by q, i.e. identifies a,b G A ii 


a G p 




iff 


b G p ^L for every p G polj^A covered by q. 


Since the relation (20) has infinite index and is the intersection of the finitely 
many relations '^q, there must be some n-ary term q G Q such that has 
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infinite index. We claim that q, when seen as a polynomial in pol„A, is not 
uniformly continuous. To prove this, consider some finite set /C C £. Because 
the index of is infinite and the index of =ic is finite, there must be some 
a,b G A such that 


a '/‘q b and a =k: b. 

Unraveling the definition of this means that there are some 02 ,..., a„ in 
the universe of A such that 

g(a,a2,...,a„) 9(6,02,..., a„), 

which proves that q is not uniformly continuous. □ 


12 Profinite monads 

In this section, we prove that the Stone dual considered in the previous section 
has sufficient structure to make it into a monad, i.e. for every monad T the 
mappings 

S i-G Stone(TS) 

/ : E —)• r !->• Stone(T/) 

can be equipped with unit and multiplication to make it a monad, which we 
will denote by T. Because T is a monad, it has its own notion of recognisability, 
which is related to but richer than the notion of recognisability of the original 
T. This richer notion of recgnisability is studied in Section [T^ on the example 
of profinite words. 

12.1 Definition of the profinite monad 

Fix for the rest of this section a monad T, in the category of sets, the general¬ 
isation to sorted sets being straightforward. We explain how to convert T into 
a monad, which we denote by T, that describes profinite objects over T. 

Types. Instead of Stone duals as studied in the previous section, we will use 
in this section an alternative definition, which has a more algebraic flavour. For 
a T-algebra A, not necessarily finite, define a T-morphism type over A to be a 
function r which maps every surjective T-morphism 

/i : A —!• B with B finite (21) 

to an element h'^ G B, subject to the condition that 

{g o hy = g{h'^) for every h : A —>■ B and g : B —> C (22) 
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where 5 is a surjective T-morphism between finite T-algebras. The set of of T- 
morphism types over a T-algebra A is called its compactification, and is denoted 
by A. As a topological space, the set of T-morphism types is an equivalent 
definition of the Stone dual defined in the previous section, as stated in the 
following fact. 

Fact 12.1 If A is a T-algebra, then StoneA is homeomorphic to A, assuming 
that the base open sets are of the form 

{t G A: =b} 

for h : A —> B a surjective T-morphism into a finite T-algebra and 6 G B. 

Nevertheless, we use T-morphism types instead of the Stone dual because 
they will be more convenient to study the algebraic structure. Define the profi- 
nite extension of a surjective T-morphism 

h-.A^B 


into a finite T-algebra B to be the function 

h:A-^B 


defined by ^(t) = h'^. In terms of profinite extensions 
following diagram commutes. 


( 221 , 


says that the 



(23) 


Example 18. Consider the monad -f of finite words, where -I—algebras are 
semigroups, and -I—morphisms are semigroup morphisms. Consider the semi¬ 
group where E is a a finite alphabet. As stated in Fact 12.1 -I—morphism 


types, or semigroup morphism types, over E+ are the same thing as elements 
of the Stone dual StoneE+, which are profinite words as described in Exam¬ 
ple We now revisit the element of the Stone dual that was described in 
Example [lOl and describe its corresponding semigroup morphism types. 

By definition, a semigroup morphism type over S’*" is a function which maps 
every semigroup morphism 

h : S’*" —> A with A a finite semigroup 

to an element of A, in a way that is c onsi stent with composition. The idem- 
potent power described in Example 


10 


is the morphism type which maps a 
morphism h to h{w)'^. Let us check that defined this way is indeed a semi¬ 
group morphism type, we need to show that for every semigroup morphisms 


h : S* 


5: A-^B 
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with A and B being finite semigroups, and g being surjective, we have 

g{h{w)*) = {goh{w))*. 


This is checked below, assuming that n and m are the sizes of A and B. 


g{h{w)*) 

gihiwT') 

g{h{w)r 

{mw)r- 

{goh{w))*. 


(by definition) 

(because y is a semigroup morphism) 

(because m < n and m! is an idempotent powe) 
(by definition) 


□ 


The functor of T. We now define the profinite monad T. We assume that 
the unit and multiplication in the original monad T are denoted by rj^ and /iy;. 
An object S is mapped by T the compactification of the T-algebra TS, i.e. 

Te = Te. 


The remaining components of the monad, i.e. how T acts on functions, as well 
as the unit and multiplication operations, are defined and proved correct in the 
following theorem. 


Theorem 12.2 There are unique operations 


T/ 

TT ^ TE 


E ^ TE 


TTe ^Te 


for f:T^E 


such that for every finite T-algebra A and every surjective T-morphism 


h : TE ^ A, 


into a finite T -algebra, 


the following diagrams commute 




Furthermore, equipped with the above operations, T is a monad. 


The rest of Section |12.1| is devoted to proving the above theorem. First 
observe that the operations from the statement of the theorem, if they exist, 
are uniquely specified by the diagrams in the statement of the theorem, because 
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an element of TE is uniquely specified by its values under all possible profinite 
extensions h. We need to check that the operations actually produce morphism 
types, i.e. the values that they produce satisfy (22). Let us first check that T/ 
produces types, i.e. that 


go/ioT = g ohoT f 
holds for every finite T-morphisms 

h : TE —>■ A g : A —> B 


where A, B are finite. This is checked below: 

g o hoT f = (by definition of T/) 
gohoTf = (by ([^) 
g o hoT f = (by definition of T/) 
gohoTf 

For the unit operation, the check is even simpler: 

g o h ofjY, = (by definition of fjT. ) 
g o h orjY. = (by definition of fjT.) 
gohofj^ 

Before checking that the multiplication operation defined in the theorem pro¬ 
duces types, we check that T is a functor, i.e. 

T{fog) = TfoTg for every / : A — E and g : F —)■ A. 

To prove the above equality, we show that the two sides of the equality are equal 
after being composed with functions of the form h with 

h : TE ^ A 


a T-morphism into a finite T-algebra. This is checked below and illustrated in 
Figure 


hoT{fog) 
hoT{fog) 
h oT f o T^ 
h oT f o T^ 
hoT f oTg 


(by definition of T(/ o g)) 
(because T is a functor) 
(by definition of Tg) 

(by definition of T/) 
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TE 



Figure 5: T is a functor. 


Multiplication in T To prove that the multiplication operation of the monad 
T is produces types, one uses the following lemma in the special case of A = TE. 


Lemma 12.3 If A is a T-algebra, then there is a unique operation 


mul^ : TA 


which makes the following diagram commute 

TA-^- 

Th 

Y 

Tb 


mulA 


y 

B 


for every T -morphism /i : A —>■ B into a finite T -algebra B. 

Proof. 


(24) 


The diagram (241 leaves no choice in the definition of mul^, since an element of 
A is uniquely defined by its images under all possible h. We check below that 
the multiplication operation is well-defined, i.e. it produces T-morphism types. 
Let then 


h : A-^B g :B C 

be T-morphisms with B and C being finite T-algebras. We need to show that 

g o ho = g oho 

This is done below and illustrated in Figure 
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mul^ 


TA 


A 



mule 


Figure 6: Multiplication in T is well-defined. 


goho 

mule oT g o h 
mule oT{g oh) 
mule o Tg o Th 
mule o Tg o Th 
g o mule o T/i 
g o mule oTh 
g o ho omul^ 


(by @) 

(by ( [^ ) 

(because T is a functor) 

(by definition of Tg) 
(because g is a T-morphism) 


(by (231) 


(by (241) 


□ 


So far we have proved that the operations in the statement of Theorem |12.2| 
are well defined, i.e. they produce T-morphism types, and that T is a functor. 
We now check the remaining axioms of a monad. We skip proving that mul¬ 
tiplication and unit are natural, i.e. the upper two diagrams in Figure We 
only show that multiplication is associative and consistent with the unit, i.e. the 
lower two diagrams in Figure 

To prove that the multiplication operation in the monad is associative, we 
apply the following lemma to the special case of A = TE. 


Lemma 12.4 Let A be a T-algebra, and let mul^ 
the following diagram eommutes: 


he as in Lemma 12.3 


Then 


TTA > TA 


Tmul^ 

I 


TA 


mul^ 


mul^ 

I 


A 
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Proof. 

Because an element of A is uniquely determined by its values under h, with h 
ranging over T-morphisms from A into finite T-algebras, it suffices to show that 
the diagram commutes when extended with such a h, i.e. 

h o mul^ o T mul A = ho mul^ o . 

Let us then fix ft, : A —?> B and prove the above equality. The calculation is 
performed below and also illustrated in Figure 



Figure 7: The four-sided faces in the diagram commute by (24), or by T applied 
to (24). The three-sided faces in the diagram commute by the definition of T 
on functions, or by T applied to the definition of T on functions. 


ft o mul^ o T mul^ 

= (by (l24|) 

mule o Tft o Tmul^ 

= (by T applied to (|24|)) 

mule o T mule o TTft 

= (because T is a functor) 

mule o T (mule o Tft] 

= (by definition of Tft) 

mule o Tmule o Tft 

= (by (24l with the T-morphism being mulB o Tft) 

mule o Tft o 

= (by definition of Tft) 

mule o Tft o 

= (by (24)) 

ft o mul^ o 



□ 
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We now check the last axiom of a monad, namely that the following diagram 
commutes: 


— 'ffv - 

TE —^ TTS 

TTs-^Ts 

PE 



Let us first check the upper triangular face of the diagram: 


ho fi^o 

mulA oTh o ry=j= 5 ] 
mulA oTh o ry=j=j, 
mul A o Th o 77=j=s 
mulA o TjAoh 
h 


(by @) 

(by definition of Th) 

(by definition of ?7 e) 

(becanse T is a monad) 

(becanse mulA is the identity on units) 


Let us now check the lower triangular face of the diagram: 


/i o /2s o Tunit 
mulA o T/i o Tunit 
mulA o T(/i o unitjj,) 
mulA o T(/i o unitjE) 
mulA o T/i o TunitjE 
mulA o T/i o Tunitjs 
mulA o T/i o TunitjE 
ho fj,-^ oT unitjs 
h 


(by (241) 


(because T is a functor) 

(by definition of r^s) 

(because T is a functor) 

(by definition of Th) 

(by definition of T?7 e) 
(because /i is a T-morphism) 
(because T is a monad) 


This completes the proof that T is a monad. 


12.2 Prom a T-algebra to a T-algebra. 

Having defined the monad T, it is natural to ask what are finite T-algebras, 
and what are the languages recognised by them. In this section, we discuss how 
every T-algebra can be transformed into a T-algebra. Since this transformation 
preserves finiteness, it gives a source of examples of finite T-algebras. However, 
the algebras produced by this transformation are not very interesting, because 
they are essentially decorations of T-algebras. More interesting examples will 
be given in Section 

Prom A to A. An element of a S A can can be interpreted as an element of 
A, namely as the T-morphism type which maps a T-morphism h to h{a). We 
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denote this interpretation by tA, by definition it makes the following diagram 
commute: 


. (25) 

h 

B 

for every T-morphism h into a finite T-algebra B. It is tempting to think of (.a 
as an embedding. However, for 6 a to be an embedding, one would require that 
every distinct elements of A can be distinguished by some T-morphism into a 
finite T-algebra. This additional assumption is true in all monads studied in this 
paper, at least for finitely generated T-algebras, but it can be false, e.g. with a 
very restrictive notion of finite T-algebra. 



An algebraic structure on A. From Lemmas 12.3 and 12.4 
if A is a T-algebra, then there is a multiplication operation 


it follows that 


mulA : T A —5- A 


which turns the compactification A into a T-algebra. The following lemma 
implies that compactification preserves flniteness of algebras. 

Lemma 12.5 //A is a finite T-algebra, then A is isomorphic to the T-algebra 
where the universe is the universe of A, and multiplication is defined to be the 
profinite extension o/mulA, i-e. by 

mulA : TA A. 


Proof. 

We claim that the isomorphism is the profinite extension 

idA ■ A —y A 


of the identity on A. We claim that the above is a bijection, because its inverse 
is 6 a- To prove bijectivity, we need to show that 

6A o idA idA o tA 


are the identity functions on A and A respectively. For the latter, we in¬ 
voke (25). The former is explained in the following diagram 


A^A 


h 

B 



lA 


A 


□ 
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Lemma 12.6 // : A —^ B is a T -morphism, then there is a unique function 

h:A^B 


which makes the following diagram commute 


A— 



for every T -morphism g : B —> C with C finite. 

Proof. 

Note that the definition of T/ is actually a special case of this lemma, because 
T/ is makes the diagram in the lemma commute for T/, i.e. T/ = T/. The 
lemma is proved the same way as we proved that T/ is well defined. □ 

The above lemma introduces a little clash of notation. If 


h:A^B 

is a T-morphism such that B if finite, then h has two definitions: namely the 
profinite extension of A, which is of type A — >■ B, and the definition from the 
above lemma, which is of the type A — B. However, the two definitions are 
essentially the same mapping, because they are equal up to the isomorphism 
from Lemma Fl 2.5 1 


is a T-morphism and makes the following diagram commute: 


Lemma 12.7 If h : A ^ B is a T-morphism, then h defined in Lemma \12.(^ 


A—^B 


I'A 

A 


h 


I'B 

B 


Proof. 

We first check the diagram in the statement of the lemma. It suffices to show 
that the diagram commutes after the lower right corner is extended with the 
profinite extension of a T-morphism / : B —C into a finite T-algebra. This is 
shown in the following diagram: 


A- - -^B 
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The proof that h is a T-morphism is in the following diagram. 



The upper and l ower faces commute by Lemma|12.3[ the right face commutes 


by Lemma 


12.6 


Lemma [12.61 □ 


and the left face commutes by applying the functor T to 


12.3 Prom a T-algebra to a T-algebra. 

In the previous section, we showed how to convert a T-algebra into a T-algebra. 
We now discuss the opposite direction. To go from a T-algebra A to a T-algebra, 
call it Aj, one keeps the same universe and defines the multiplication operation 
by 





mulA 

Y 


A 


The following lemma shows that this construction is correct. In the specific 
case of the monad of finite words, the lemma says that a profinite semigroup is 
actually a semigroup (it has other structure as well). 

Lemma 12.8 If A is a T-algebra, then Aj is a T-algebra. // h : A —>■ B is a 
T-morphism, then the function underlying h is a T-morphism from Aj to Bj. 

Proof. 

By Lemma 

Ttjs : TTE ^ TTE 


12.7 applied to h being 


we see that the following diagram commutes 


TTE TTE 


TTE 


TTe 
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Let us write its for the diagonal of the above diagram. 

To prove that Aj is a T-algebra, we will show that the following diagram 
commutes (the outer perimeter of the diagaram says that mulAy is associative 
as required in a T-algebra): 


TTA 




TA 



ihuIa^ 


The middle face of the diagram is the assumption that A is a T-algebra. The 
right and bottom faces are the definition of mulAy The top face can be shown 
using the definition of multiplication in TA, and does not use the algebraic 
structure on A. Finally, for the left face, we use the following diagram: 

TTA 



The rectangular face commutes by Lemma 12.7 The lower triangular face 
commutes by applying the functor T to the definition of mulAj- The upper 
triangular face commutes because it is the definition of lla- 

This completes the proof that Aj is a T-algebra. To prove that h as in the 
statement of the lemma is a T-morphism, we consider the following diagram: 


TA- - - ^TB 



The left and right faces commute by definitions of multiplication in Ay and Bj. 
The bottom face commutes by assumption that /i is a T-morphism. The top 
face commutes by Lemma [T^ □ 
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12.4 Clopen languages and Stone algebras 

A well known result for proflnite words is that there is a one-to-one correspon¬ 
dence between clopen subsets of the proflnite monoid over S, and recognisable 
subsets of £■*■. In this section we prove Theorem 12.9 which generalises this 
observation to monads. 

Recall that if A is a T-algebra, then by Fact 12.1 there is a topological 


structure on A, which is homeomorphic to the Stone dual StoneA. Recall also 
the mapping (.a : A —> A. Using these two notions, for L C A, we define 
L C A to be t he clo sure, in the topology of A, of the image of L under the 
6 a- In Theorem 12.9 we will show that the clopen subsets of A are exactly the 
closures, in the sense just defined, of T-recognisable subsets of A. 

Theorem 12.9 Let A. he a T-algebra. A subset of A is clopen if and only if it 
is of equal to L for some recognisable L C A. 


Before proving the theorem, we present a lemma. 


Lemma 12.10 If h : A ^ 'B is a T-morphism into a finite T-algebra, then 
h~^{F) = (h)~^{F) for every F C B. 

Proof. 

Recall that a base open set in A is a set of the form 

h~^[b) for some T-morphism h : A —^ B and b G B, 

and such sets are also closed. Therefore, the set on the right side of the equality 
in the statement of the lemma is closed, as a finite union of base sets. To 
complete the proof, we show that the image of h~^{F) under 6 a is dense in 
the set on the right side, i.e. every open subset of the right side contains 6 a( a) 
for some a G A with h{a) G F. Every open set contains a base open set, and 
therefore it suffices to show that if 


g:A^C 


is a T-morphism into a finite T-algebra, and the base open set g~^{c) is included 
in the right side of the equality, then g~^{c) contains 6 a (a) for some a G A with 
h{a) G F. For a it suffices to choose any element of g~^{c), which is easily 
shown to belong to h~^{F). □ 


Proof, (of Theorem 12.9) 

Let us begin with the left-to-right implication. Consider a clopen subset of A. 
Like any clopen set in a compact space, this is a finite union of base open sets. 
By definition of closed base open sets in A and Lemma 12.10 we see that every 
clopen subset of A can be represented as a finite union 


\Jh-\F,) =[}h-\F,). 

i i 

The right side is as required in the statement of the theorem. The right-to-left 
implication is done by reversing the above reasoning. □ 
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Stones In Section 12.2 we showed how to convert T-algebras into T-algebras, 
and how to convert T-morphisms into T-morphisms. We now explain that the 
T-algebras and T-morphisms produced this way have special topological prop¬ 
erties. 

A T-algebra A is called Stone if its universe is finite, and the multiplication 
operation is continuous assuming the discrete topology on the universe. The 
reason for this name is that the discrete topology is the only one which makes 
the finite universe a Stone space, i.e. a compact totally disconnected Hausdorff 
topological spac^ As shown in the following lemma. Stone T-algebras are 
essentially the same thing as finite T-algebras. In Section we will show 
examples of finite T-algebras that are not Stone. 


Theorem 12.11 Up to isomorphism, the mappings 


h : A 


A 

B 


!->• 


A 

h-.k^B 


are one-to-one correspondences between, respectively: 

• finite T -algebras and finite T -algebras that are Stone; and 

• T -morphisms into finite T -algebras and continuous T -morphisms into fi¬ 
nite T -algebras that are Stone. 

Proof. 

We only consider the first mapping, the second is proved in a similar way. We 
will show that if A is a finite T-algebra then A is Stone, and if A is a Stone, 
then At is a finite T-algebra. In Lemma flZ.SI we have shown that if A is a finite 
T-algebra, then A is isomorphic to the algebra whose universe is the universe 
of A, and whose multiplication operation is 

mulA : TA —>■ A, 

i.e. the profinite extension of the original multiplication. Every profinite exten¬ 
sion is continuous (assuming the discrete topology on the image) by definition 
of the topology in A, and therefore A is Stone. To prove that the correspon¬ 
dence is one-to-one up to isomorphism, we need to show that if A is a Stone 
T-algebra then it is isomorphic to Ay, and if A is a finite T-algebra then it is 
isomorphic to (A)t. We only prove the former isomorphism. Let then A be a 
Stone algebra. By Lemma [12. 5[ it suffices to show that the two multiplication 
operations 


mulA : TA —>■ A 

ihuIat : TA —A 

^Actually, already the Hausdorff requirement implies discreteness, but Stone spaces are 
closely connected to profiniteness. 
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are equal. By definition, the two operations agree on elements in the image of 
TA under 


i'j A ■ A —y T.A. 


This image is a dense subset of TA. Because A is finite, TA is a metric space 
(we implicitly assume that there are countably many finite T-algebras up to 
isomorphism), and therefore both multiplication operations are uniformly con¬ 
tinuous functions that agree on a dense subset. Such functions must be equal. 
□ 


13 Profinite words 


In this section, we illustrate the profinite monad construction from Section 
in the special case of words. Consider the monad E i—E+ of finite words. 
Let us den ote by E i—> E’*' the profinite version of this mona d, as defined in 
Section 12 In particular, E+ is a semigroup thanks to Lem ma 12.8 and E+ is 

An element of E+ 


12.1 


has a topology which makes it a Stone space by Fact 
is called a profinite word over the alphabet E. One of the results of this section 
is Theorem |13.3[ which implies that MSO is undecidable over profinite words, 
already with two predicates. 

every finite semigroup S, can be extended to a 


As shown in Section 12.2 


-I—algebra S with the same universe, where the multiplication operation 


mulo : S~^ 


is continuous assuming the profinite topology on the domain and the discrete 
topology on the image. In this section we give an example of a finite +-algebra 
that is not obtained this way, because the multiplication operation is not going 
to be continuous assuming the discrete topology on the image. 


13.1 The unboundedness language 

We say that a profinite word w € E’*' has at least n letters in a subset T C E if 
it has value n under h where 

: E+ ^ {0, l,...,n} 

is the semigroup morphism which counts the number of letters in T up to thresh¬ 
old n. A profinite word is said to have exactly n letters from a set if it has at 
least n letters from the set but not at least n -|- 1. If a profinite word has at 
least n letters in T for every n, then we say that it has an unbounded number 
of letters in T. 

Lemma 13.1 The set of profinite words in {0,1}“'' which have unboundedly 
many ones is +-recognisable. 
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Proof. 

We show that the set in the statement of the lemma is recognised by a +- 
morphism 

h : {0,1}+ ^ A 

where A is the finite +-algebra defined as follows. The universe of A has 
three elements, call them 0,1 and oo, which represent profinite words that have 
zero ones, a bounded number of ones, and an unboundedly number of ones 
respectively. The multiplication operation 

mulA : A+ —>■ A 

is defined as follows. If the argument has only zeros, the value is zero. If the 
argument has at least one letter oo, or unboundedly many ones, then the value 
is oo. Otherwise the value is one. Note that the multiplication operation is 
not continuous, at least assuming a discrete topology on the universe, because 
the inverse image of 1 is not closed. We now prove that this multiplication is 
associative, i.e. that the following diagram commutes: 


(A+)+ 

(hiuIa)^ 

A+ - 


mulA 


■A+ 

mulA 

► A 


where fiA denotes the multiplication operation of the profinite monad. 

To prove that the above diagram commutes, we need to show that 

mulA((mulA)+(w)) = mulA(Mn(w))- (26) 


holds for every profinite word of profinite words w S (A+)+. We consider two 
cases, depending on whether w has an unbounded number of letters in the set 
A+ -0+. 


The word w has an unbounded number of letters outside 0+. We will show 
that (26) holds, because both sides are equal to oo. Consider first the left 
side. Let 


hn ■ A+ —>■ {0,..., n} 

be the semigroup morphism that counts the number of nonzero letters. 
Our assumption on w says that (/ii)+(w) has unboundedly many ones. 
Since the image of hi is a subset of A, it makes sense to compare values 
of hi with values of mulA, in particular the following observation is easy 
to get: 

hi{v) < mulA('c) for every v S A+. 
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As in the proof of Lemma |6.2[ a binary relation R C X x Y lifts to a 
relation C TA x TY. Apply this construction to the natural ordering 
on A, and call < the resulting relation on A+. As we have observed, 

(hi)^{w) <+ (mulA)^(w). 


The profinite word on the left of the above inequality has an unbounded 
number of ones by our assumption, and therefore it is mapped by mulA 
to oo. It is not difficult to see that the mapping mulA is monotone with 
respect to <, and therefore the left side of the equality in (26) is oo. 


To prove that the right side of the equality in (26) is also oo, by definition 
of mulA we need to show that every n satisfies 

h„{p.Aiw)) = n. 


Let mul„ be the multiplication operation in the semigroup {0, 
Theorem |12.2| says that 


A++ 


^TA 


T{0,... ,n} 


muln 


hn 


^{0, ...,n} 


To prove that the right side of the equality in (26) is also oo, from the 
definition of mulA we need to show that for every n, if start with w and 
consider the right-down path in the above diagram, then we get n. Because 
the diagram commutes, we can also consider the down-right path. Our 
assumption on w says that (hn)~^{w) is a profinite word which has an 
unbounded number of nonzero letters. On such words, hn gives result n. 


• The other case is when w has a bounded number of letters outside 0+. We 
begin with a straightforward lemma, which uses the semigroup structure 
of profinite words that was described in Lemma [12. 8| Let us denote the 
unit of the profinite monad by rys, i-e. if a G S then ?7 e(«) G S’*' is the 
corresponding profinite word. 


Lemma 13.2 If w G S’*" has a bounded number of letters m T C E then 
it admits a finite decomposition 

w = wo- r?s(ai) • wi • • • Wn-i ■ f?s(a„) • w„ 

where wq, ... ,Wn are profinite words over the alphabet E — T, ai,... ,an 
are letters in T, and the dot stands for concatenation in the profinite semi¬ 
group. 


By applying Lemma 13. 2[ there is a decomposition 


W = Wo ■ 'nA+ {ai) -Wi - ■■ Wn-l ■ PA+ (a„) • Wn 


88 

















where Wi G (0''')+, G A+ — 0+, and the dot is concatenation in the 
profinite semigroup over alphabet A^. Lemma 


12.8 


implies that 


^iA{w) = HAiwo) ■ ai ■ ^iAiwi) ■ ■ ■ ^A(Wn-l) ' an ' fJ-A{w„) 


where the dot is concatenation in the profinite semigroup over alphabet 
A. Since mulA is a semigroup morphism, and it maps words in O’*" to the 
identity in A, it follows that 


hiuIa(/ iA('i«)) = niulA(ai) • • • mulA(a„). 
Let us now consider mulA((mulA)~'’('!r'))- Lemma 


12.8 


says that (mulA) 


is a semigroup morphism, and therefore mulA'''('a>) is equal to 
mulA’^(wo) • mulA^(^A+(ai)) ' mulA’^(iCi) • • • mulA’^(w„-i) • mulA'‘’(?7A+(an)) ' mulA"^(«;„). 


By the axioms of a monad, we have 

mulA''"(j7A+(«i)) = ^A(mulA(ai))- 

Each word mulA~''(rCi) in the decomposition of mulA'''(rc) belongs to 0+. 
Therefore, because mulA is a semigroup morphism that maps O’*' to the 
identity, we get 

mulA(mulA'''(w)) = mulA(j7A(niulA(aj))) • • • mulA(j7A(niulA(a„))). 

The result follows because mulA (^a (a)) = a holds for every a G A. 

This completes the proof that mulA : A^ —> A is a +-morphism. □ 

Let us define MSO+inf to by applying the abstract notion of MSO defined in 
Section |6.1[ with the base predicates being the language of unboundedly many 
ones from the previous lemma, and the profinite closure of the language “some a 
comes before some 5”. This class of languages of profinite words was considered 


it follows that MSO+inf contains only +-recognisable languages. It is not clear 
if it contains all +-recognisable languages. 


in [Tor 11] and |Torl2j . adjusting for the monad terminology. From Lemma 6.2 


Theorem 13.3 The satisfiability problem for MSO+inf is undecidable. 


Proof. 

Consider MSO+U on infinite words, which is an extension of MSO. This logic is 
shown undecidable in |MB15j . Corollary 2 of [Tor 12] shows that decidability of 
MSO+U on infinite words reduces to decidability of MSO+inf on profinite words. 
□ 
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